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O (54) Title: UNC-5 CONSTRUCTS AND SCREENING METHODS 

O ( 57 ) Abstract: The invention provides novel splice variants of the human unc-5c cDNA and a novel human unc-5HSl cDNA se- 
£^ quence. Also provided are assays based on protein-protein interactions between the UNC-5 protein and a variety of different inter- 
^* acting proteins. 
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UNC-5 constructs and screening methods 

The present invention is concerned with unc-5, a 
conserved animal gene family that encodes proteins 
5 implicated in directional cell behaviour. In 

particular, the invention is concerned with novel 
splice variants of the human unc-5C cDNA and a novel 
human unc-5HSl cDNA sequence. In addition, assays are 
provided based on protein-protein interactions between 

10 the UNC-5 protein and a variety of different 
interacting proteins. 

Unc-5 is a conserved animal gene family that 
encodes proteins implicated in directional cell 
behaviour. The unc-5 gene of the nematode worm 

15 Caenorhabditis elegans (C. elegans) is known to be 
involved in dorsal migration in contrast to unc-40 
which is involved in ventral migrations (Hedgecock et 
al., Neuron Vol. 2; 61-85, 1990). Both the unc-5 and 
unc-40 genes are associated with the netrin unc-6, and 

20 all three genes play a dominant role in directional 
neuronal outgrowth . 

The C. elegans unc-5 gene encodes a 919 amino 
acid transmembrane receptor with two immunoglobulin 
and two thrombospondin type I extracellular domains 

25 (Leung-Hagesteijn et al.. Cell Vol. 71:289-299, 1992). 
Ectopic overexpression of unc-5 in the C. elegans 
touch neurons resulted in dorsal steering of these, 
instead of the normal ventral elongation of these 
neurons (Hamelin et al.. Nature, 364:327-330, 1993). 

30 Several vertebrate homologues of unc-5 have been 

cloned including the Rattus norvegicus uncSHl and 
unc5H2 (Leonardo et al., Nature Vol. 386:833-838, 
1997), a Mus musculus homologue designated rem 
(Ackerman et al, Nature Vol. 386:838-842, 1997) and a 

35 human homologue unc5C (Ackerman et al., Genomics Vol. 
52:205-208, 1998) . 
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The intracellular part of the UNC-5 proteins 
contains a ZO-1 domain. Such domains are known to be 
involved in tight junction biology. Furthermore UNC-5 
proteins contain a death domain. So far this is the 
5 only protein found in C. elegans that harbors such a 
death domain. Death domains are involved in the 
apoptotic process. In this process, caspases play an 
important role. The human UNC-40 homologue DCC, a 
protein also known involved in axonal outgrowth, is a 
10 caspase-3 substrate (Mehen et al., Nature 395:801-804, 
1998). 

The present inventors have identified three 
previously unknown variant unc-5C cDNAs . These 
variant cDNAs correspond to alternatively spliced unc- 
15 5C transcripts. 

Accordingly, in a first aspect provides a protein 
which comprises the amino acid sequence set forth in 
SEQ ID NO: 2, SEQ ID NO: 4 or SEQ ID NO: 6 or an amino 
acid sequence which differs from that shown in SEQ ID 
20 NO: 2, SEQ ID NO: 4 or SEQ ID NO: 6 only in 
conservative amino acid changes. 

Also provided by the invention are nucleic acid 
sequences which encode the proteins of the invention. 

Also provided by the invention are a nucleic acid 
25 comprising the sequences of nucleotides set forth in 
SEQ ID NO: 1, a nucleic acid comprising the sequences 
of nucleotides set forth in SEQ ID NO: 3 and a nucleic 
acid comprising the sequences of nucleotides set forth 
in SEQ ID NO: 5. 
30 The splice variants of human unc-5C were cloned 

by PCR technology. Two primers were developed to 
amplify the intracellular part of the unc-5C. Human 
Brain cDNA was used for this purpose. Three new splice 
variants of human unc-5C were characterized. A 
35 schematic representation of these splice variants is 
given in Figure 5. 

The first splice variant (designated unc-5Cb) has 
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a deletion of an intron in the UP region. The 
nucleotide sequence of a partial unc-5Cb cDNA is set 
forth in SEQ ID NO: 1 and the corresponding amino acid 
sequence is set forth in SEQ ID NO: 2. The splice of 
5 this intron results in a UNC-5Cb protein which is 

considerably shorter than the previously known UNC-5C, 
as the coding frame is not maintained. This protein is 
truncated for the DD domain and for the major part of 
the UP domain. 

10 The second splice variant (designated unc-5Cc) is 

deleted by an intron in the ZO-1 region, also 
resulting in a shorter protein than the previously 
known UNC-5C, as the coding frame is not maintained. 
The nucleotide sequence of a partial UNC-5CC cDNA is 

15 set forth in SEQ ID NO: 3 and the corresponding amino 
acid sequence is shown in SEQ ID NO: 4. The resulting 
protein (UNC-5Cc) is truncated for the DD domain, the 
UP domain and a part of the ZO-1 domain. 

The third splice variant (unc-5C8) is deleted by 

20 a small intron in the ZO-1 domain, but the coding 
frame is maintained. This results in a slightly 
smaller protein (UNC-5C8), wherein only the amino acid 
sequence coded by the spliced intron is truncated. 
The nucleotide sequence of a partial UNC-5C8 cDNA is 

25 set forth in SEQ ID NO: 5 and the corresponding amino 
acid sequence is shown in SEQ ID NO: 6. 

The presence of various splice variants of unc-5C 
in the human brain indicated that the activity of 
UNC-5C is tightly regulated. 

30 The inventors have also identified a human unc-5 

cDNA which shares homology with the Rattus norvegicus 
unc-5Hl cDNA. 

Accordingly, in a further aspect the invention 
provides a nucleic acid molecule comprising the 

35 sequence of nucleotides set forth in SEQ ID NO: 7. 

Whilst performing yeast two hybrid experiments to 
identify proteins which interact with the human UNC-5C 
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protein the inventors identified a number of 
heretofore unknown human cDNAs which encode proteins 
which interact with human UNC-5C . 

Accordingly, the invention further provides 
5 a nucleic acid comprising the sequence of nucleotides 
set forth in SEQ ID NO: 56 and a sequence of 
nucleotides complementary to the sequence of 
nucleotides set forth in SEQ ID NO: 57, a nucleic acid 
comprising the sequence of nucleotides set forth in 

10 SEQ ID NO: 54 and a sequence of nucleotides 

complementary to the sequence of nucleotides set forth 
in SEQ ID NO: 55, a nucleic acid comprising the 
sequence of nucleotides set forth in SEQ ID NO: 61 and 
a sequence of nucleotides complementary to the 

15 sequence of nucleotides set forth in SEQ ID NO: 62 
and a nucleic acid comprising the sequence of 
nucleotides set forth in SEQ ID NO: 63 and a sequence 
of nucleotides complementary to the sequence of 
nucleotides set forth in SEQ ID NO: 64 , 

20 The nucleic acid molecules according to the 

invention may, advantageously, be included in a 
suitable expression vector to express the proteins 
encoded therefrom in a suitable host. Incorporation 
of cloned DNA into a suitable expression vector for 

25 subsequent transformation of said cell and subsequent 
selection of the transformed cells is well known to 
those skilled in the art as provided in Sambrook et 
al. (1989), molecular cloning, a laboratory manual, 
Cold Spring Harbour Laboratory Press. 

30 An expression vector according to the invention 

includes a vector having a nucleic acid according to 
the invention operably linked to regulatory sequences, 
such as promoter regions, that are capable of 
effecting expression of said DNA fragments. The term 

35 "operably linked" refers to a juxtaposition wherein 
the components described are in a relationship 
permitting them to function in their intended manner. 
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Such vectors may be transformed into a suitable host 
cell to provide for expression of a protein according 
to the invention. Thus, in a further aspect, t-hp 
invention provides a process for preparing proteins 
5 according to the invention which comprises cultivating 
a host cell, transformed or transfected with an 
expression vector as described above under conditions 
to provide for expression by the vector of a coding 
sequence encoding the protein, and recovering the 

10 expressed protein. 

The vectors may be, for example, plasmid, virus 
or phage vectors provided with an origin of 
replication, and optionally a promoter for the 
expression of said nucleotide and optionally a 

15 regulator of the promoter. The vectors may contain 

one or more selectable markers, such as, for example, 
an antibiotic resistance. 

Regulatory elements required for expression 
include promoter sequences to bind RNA polymerase and 

20 to direct an appropriate level of transcription 

initiation and also translation initiation sequences 
for ribosome binding. For example, a bacterial 
expression vector may include a promoter such as the 
lac promoter and for translation initiation the Shine- 

25 Dalgarno sequence and the start codon AUG. Similarly, 
a eukaryotic expression vector may include a 
heterologous or homologous promoter for RNA polymerase 
II, a downstream polyadenylation signal, the start 
codon AUG, and a termination codon for detachment of 

30 the ribosome. Such vectors may be obtained 

commercially or be assembled from the sequences 
described by methods well known in the art. 

Nucleic acid molecules according to the invention 
may be inserted into the vectors described in an 

35 antisense orientation in order to provide for the 

production of antisense RNA. Antisense RNA or other 
antisense nucleic acids, including antisense peptide 
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nucleic acid (PNA), may be produced by synthetic 
means . 

In accordance with the present invention, a 
defined nucleic acid includes not only the identical 

5 nucleic acid but also any minor base variations 

including in particular, substitutions in cases which 
result in a synonymous codon (a different codon 
specifying the same amino acid residue) due to the 
degenerate code in conservative amino acid 
10 substitutions. The term "nucleic acid sequence" also 
includes the complementary sequence to any single 
stranded sequence given regarding base variations- 

The nucleic acid sequences according to the 
invention may be produced using recombinant or 

15 synthetic techniques, such as for example using PCR 
which generally involves making a pair of primers, 
which may be from approximately 10 to 50 nucleotides 
to a region of the gene which is desired to be cloned, 
bringing the primers into contact with cDNA, or 

20 genomic DNA from a human cell, performing a polymerase 
chain reaction under conditions which brings about 
amplification of the desired region, isolating the 
amplified region or fragment and recovering the 
amplified DNA. Generally, such techniques are well 

25 known in the art, such as described in Sambrook et al. 
(Molecular Cloning: a Laboratory Manual, 1989) . 

The nucleic acids according to the invention may 
carry a revealing label. Suitable labels include 
radioisotopes such as 32 P or 35 S, enzyme labels or 

30 other protein labels such as biotin or fluorescent 
markers. Such labels may be added to the nucleic 
"acids or oligonucleotides of the invention and may be 
detected using known techniques per se. 

The protein according to the invention includes 

35 all possible amino acid variants encoded by the 
nucleic acid molecule according to the invention 
including a protein encoded by said molecule and 
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having conservative amino acid changes. Proteins or 
polypeptides according to the invention further 
include variants of such sequences., including 
naturally occurring allelic variants which are 
5 substantially homologous to said proteins or 

polypeptides. In this context, substantial homology 
is regarded as a sequence which has at least 70%, 
preferably 80 or 90% and preferably 95% amino acid 
homology with the proteins or polypeptides encoded by 

10 the nucleic acid molecules according to the invention. 
The protein according to the invention may be 
recombinant, synthetic or naturally occurring, but is 
preferably recombinant. 

A further aspect of the invention provides a host 

15 cell or organism, transformed or transfected with an 
expression vector according to the invention. The 
host cell or organism may advantageously be used in a 
method of producing protein, which comprises 
recovering any expressed protein from the host or 

20 organism transformed or transfected with the 
expression vector . 

According to a further aspect of the invention 
there is also provided a transgenic cell, tissue or 
organism comprising a transgene capable of expressing 

25 a protein according to the invention. The term 
"'transgene capable of expressing" as used herein 
encompasses any suitable nucleic acid sequence which 
leads to expression of proteins having the same 
function and/or activity. The transgene, may include, 

30 for example, genomic nucleic acid isolated from human 
cells or synthetic nucleic acid, including DNA 
integrated into the genome or in an extrachromosomal 
state. Preferably, the transgene comprises the 
nucleic acid sequence encoding the proteins according 

35 to the invention as described herein, or a functional 
fragment of said nucleic acid. A functional fragment 
of said nucleic acid should be taken to mean a 
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fragment of the gene comprising said nucleic acid 
coding for the proteins according to the invention or 
a functional equivalent, derivative or a non- 
functional derivative such as a dominant negative 
mutant, or bioprecusor of said proteins. 

The protein expressed by said transgenic cell, 
tissue or organism or a functional equivalent or 
bioprecusor of said protein also forms part of the 
present invention. Recombinant proteins may be 
recovered and purified from host cell cultures by 
methods known in the art, including ammonium sulfate 
or ethanol precipitation, acid extraction, anion or 
cation exchange chromatography, phosphocellulose, 
chromatography, hydrophobic interaction 
chromatography, affinity chromatography, 
hydroxyapatite chromatography and lectin 
chromatography . 

The protein of the present invention may be a 
naturally purified product, or a product of chemical 
synthetic procedures, or produced by recombinant 
techniques from a prokaryotic or eukaryotic host (for 
example, by bacterial yeast, higher plant, insect and 
mammalian cells in culture) . Depending upon the host 
employed in a recombinant production procedure, the 
expressed protein may lack the initiating methionine 
residue as a result of post-translational cleavage. 
Proteins which have been modified in this way are also 
included within the scope of the invention. 

In a still further aspect the invention provides 
an antibody capable of specifically binding to a 
protein according to the invention. Preferably the 
antibody is capable of specifically binding to a 
protein comprising the sequence of amino acids set 
forth in SEQ ID NO: 2, SEQ ID NO: 4 or SEQ ID NO: 6. 
An antibody according to the invention may be raised 
according to standard techniques well known to those 
skilled in the art by using the protein of the 
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invention or a fragment or single epitope thereof as 
the challenging antigen. 

A further aspect of the invention comprises a 
nucleic acid capable of hybridising to the nucleic 
5 acids according to the invention, and preferably 

capable of hybridising to the sequence of nucleotides 
set forth in SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 
56, SEQ ID NO: 57, SEQ ID NO: 61, SEQ ID NO: 62, SEQ 
ID NO: 63 or SEQ ID NO: 64, under high stringency 

10 conditions. Conditions of stringency are well known 
to those skilled in the art. 

Stringency of hybridisation as used herein refers 
to conditions under which polynucleic acids are 
stable. The stability of hybrids is reflected in the 

15 melting temperature (Tm) of the hybrids. Tm can be 
approximated by the formula: 

81.5°C+16. 6(log 10 [Na + ]+0.41 (%G&C) -600/1 

20 wherein 1 is the length of the hybrids in nucleotides . 
Tm decreases approximately by 1-1. 5°C with every 1% 
decrease in sequence homology. 

The nucleic acid capable of hybridising to 
nucleic acid molecules according to the invention will 

25 generally be at least 70%, preferably at least 80 or 

90% and more preferably at least 95% homologous to the 
nucleotide sequences according to the invention. 

The present invention also advantageously 
provides oligonucleotides consisting essentially of at 

30 least 10 consecutive nucleotides of a nucleic acid 

according to the invention and preferably from 10 to 
50 consecutive nucleotides of a nucleic acid according 
to the invention, in particular a nucleic acid 
comprising the sequence of nucleotides shown in SEQ ID 

35 NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, 
SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63 or SEQ ID 
NO: 64. These oligonucleotides may, advantageously be 
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-10- 
used as probes or primers to initiate replication, or 
the like. Oligonucleotides having a defined sequence 
may be produced according to techniques well known in 
the art, such as by recombinant or synthetic means. 
5 They may also be used in diagnostic kits or the like 

for detecting the presence of a nucleic acid according 
to the invention. These tests generally comprise 
contacting the probe with the sample under hybridising 
conditions and detecting for the presence of any 

10 duplex or triplex formation between the probe and any 
nucleic acid in the sample. 

To address the functional role of UNC-5 within 
the cell the inventors used the yeast two hybrid 
method (Fields and Song, Nature 340:245, 1989), a 

15 method well known to molecular biologists, both to 

investigate the ability of UNC-5 to form dimers and to 
search for proteins that interact with the UNC-5 
protein. Using the two hybrid approach the inventors 
were able to demonstrate that UNC-5 is capable of 

20 forming homodimers and identified a number of proteins 
which interact with the intracellular domains of the 
C. elegans unc-5 or human UNC-5 proteins. These newly 
identified protein-protein interactions involving UNC- 
5 may represent important events in cellular 

25 signalling, hence compounds which disrupt these 
interactions may potentially have useful 
pharmacological properties . 

Accordingly, in a further aspect the invention 
provides a method of identifying compounds which are 

30 capable of inhibiting or enhancing the binding of an 
UNC-5 protein to an interacting protein previously 
identified as binding to the said UNC-5 protein, which 
method comprises: 

providing a host cell containing a DNA 

35 construct comprising a. reporter gene operatively 

linked to a promoter regulated by a transcription 
factor having a DNA binding domain and an 
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activating domain; 

expressing in said host cell a first hybrid 

ON A cpmionro onrnrli nrr ;=» f i ret fn o -J ^ ^- ^ ^ _ 

~ *■» ^ — -.-.-.w- -~w^w** ^^N^W^-J-il. 

comprising an UNC-5 protein or a fragment thereof 
5 fused in-frame to either the DNA binding domain 

or the activating domain of the said 
transcription factor; 

expressing in said host cell a second hybrid 
DNA sequence encoding a second fusion protein 

1° comprising an interacting protein or a fragment 

thereof fused in-frame to either the DNA binding 
domain or the activating domain of the said 
transcription factor, such that when the first 
fusion protein comprises the activation domain of 

15 the said transcription factor the second fusion 

protein comprises the DNA binding domain of the 
said transcription factor and when the first 
fusion protein comprises the DNA binding domain 
of the transcription factor the second fusion 

20 protein comprises the activation domain; 

contacting the host cell with a sample of 
the compound under test; and 

detecting any binding of the UNC-5 protein 
or fragment thereof to the interacting protein or 

25 fragment thereof by detecting the production of 

any reporter gene product in the said host cell. 

The method of the invention is based upon the 
standard two hybrid assay well known in the art. 
30 Preferably the host cell is a yeast cell. Protocols 

for performing a yeast two hybrid assay are well known 
in the art and are given in the Examples included 
herein. 

As would be readily apparent to persons skilled 
35 in the art, the assay can be performed in either 
orientation. Tha;t is to say, the assay can be 
performed using an UNC-5 protein or a fragment thereof 
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fused to the DNA binding domain of the transcription 
factor and the interacting protein or fragment thereof 
fused to the activation domain of the transcription 
factor or alternatively the assay can be performed 
5 using an UNC-5 protein or a fragment thereof fused to 
the activation domain of the transcription factor and 
the interacting protein or fragment thereof fused to 
the DNA binding domain of the transcription factor. 

The above-described method based on the classical 

10 yeast two hybrid system can be used to screen for 
compounds that inhibit or enhance the interaction 
between two proteins. In addition, other systems have 
been developed to screen for dissociation events, 
these methods are designated reverse hybrid methods. 

15 These systems make use of yeast strains in which the 
expression of interacting hybrid proteins increases 
the expression of a counter-selectable marker that is 
toxic under particular conditions. Under these 
conditions, dissociation of an interaction provides a 

20 selective advantage, thereby facilitating detection: A 
few growing yeast colonies in which hybrids fail to 
interact can be identified among millions of 
non-growing colonies expressing interacting proteins. 
Several reverse hybrid systems are known in the art. 

25 The first reverse two-hybrid system utilizes a 

yeast strain, which is resistant to cycloheximide due 
to the presence of a mutant CYH2 gene. This strain 
also contains the wild-type CYH2 allele under the 
transcriptional control of the GAL1 promoter. 

30 Expression of the wild-type GAL 4 protein is sufficient 
to restore growth sensitivity to cycloheximide. Growth 
sensitivity towards cycloheximide is also restored by 
the co-expression of the avian c-Rel protein and its 
IkB-cx counterpart, p40, as GAL 4 fusion proteins. 

35 Restoration of growth sensitivity towards 

cycloheximide requires the association of c-REL and 
p40 at the GAL1 promoter and correlates with the 
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ability of the c-REL/p40 interaction to activate 
expression from the GAL1 promoter (Leanna and Hannink, 
1996, NAR 24:3341-3347) 

Another reverse hybrid system makes use of the 
5 most widely used counter-selectable marker in yeast 
genetics, URA3, which encodes orotidine-5' -phosphate 
decarboxylase, an enzyme required for the biosynthesis 
of uracil- Yeast cells that contain wild-type URA3 , 
either on a plasmid or integrated in the genome, grow 

10 on media lacking uracil (URA3+ phenotype) . However, 
the URA3-encoded decarboxylase can also catalyze the 
conversion of a non-toxic analogue, 5-f luorooritic 
acid (FOA) into a toxic product, 5-fluoroacil (Boeke 
et al., 1984, Mol . Gen. Genet. 197:345-346). Hence 

15 mutations that prevent an interaction can be selected 
from large libraries of randomly mutated alleles. 
Similarly, molecules that dissociate or prevent an 
interaction could be selected from large libraries of 
peptides or compounds (Vidal et al., 1996, PNAS 

20 93:10315-10320; Vidal et al., 1996, PNAS 
93:10321-10326) . 

A third reversed yeast two hybrid is based on the 
use of GAL 80 gene as relay gene. GAL 80 encodes a 
protein that binds to and masks the activation domain 

25 of a transcriptional activator, such as GAL 4 . The 

reporter genes, which will provide the transcriptional 
read-out (i.e. HIS3 or LACZ) , are dependent upon the 
functional GAL4 for expression. Only when the level of 
GAL 80 masking protein is reduced by interfering with 

30 the two-hybrid interaction will Gal4 function as a 
transcriptional activator, providing a positive 
transcriptional read-out for molecules that inhibit 
the two-hybrid protein-protein interaction. An 
important feature of this reverse two-hybrid system is 

35 that the basal level and the half-time of the relay 
protein, GAL80, can be fine-tuned to provide maximum 
sensitivity (Powers and Erickson, 1996, WO95/26400) . 
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The invention further provides a method of 
identifying compounds which are capable of inhibiting 
or enhancing the binding of an UNC-5 protein to an 
interacting protein previously identified as binding 
5 to the said UNC-5 protein, which method comprises: 
providing a transgenic cell or organism 
expressing a first fusion protein comprising an 
UNC-5 protein or a fragment thereof fused in- 
frame to a first genetically encoded fluorophore 
10 and a second fusion protein comprising an 

interacting protein or a fragment thereof fused 
in-frame to a second genetically encoded 
fluorophore, the first and second fluorophores 
being characterised in that the emission spectrum 
15 of one of the fluorophores overlaps with the 

absorption spectrum of the other fluorophore; 

measuring the amount of fluorescence emitted 
from the fluorophore having an emission spectrum 
which overlaps with the absorption spectrum of 
20 the other fluorophore; 

exposing the transgenic cell or organism to 
a compound under test; and 

detecting any change in the amount of 
fluorescence emitted fluorescence emitted from 
25 the fluorophore having an emission spectrum which 

overlaps with the absorption spectrum of the 
other fluorophore . 

This method uses fluorescence energy transfer or 
30 FRET, a technique well known in the art for the 

detection and quantitative measurement of a whole 
range of specific binding interactions in biological 
systems, to screen for compounds which modulate the 
binding of UNC-5 or a fragment thereof to an 
35 interacting protein. The general principles of FRET 
are as follows: one component of a binding pair is 
labelled with a first fluorophore (hereinafter 
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referred to as the donor fluorophore) and a second 
component of the binding pair is labelled with a 
second fluorophore (hereinafter referred to as the 
acceptor fluorophore) . 
5 It is an essential feature of the FRET technique 

that the fluorescence emission spectrum of the donor 
fluorophore overlaps with the absorption spectrum of 
the acceptor fluorophore, such that when the two 
components of the binding pair bind to each other, 

10 bringing the donor and acceptor fluorophores into 
close proximity, a proportion of the fluorescent 
signal emitted by the donor fluorophore (following 
irradiation with incident radiation of a wavelength 
absorbed by the donor fluorophore) will be absorbed by 

15 the proximal acceptor fluorophore (a process known in 
the art as fluorescence energy transfer) with the 
result that a proportion of the fluorescent signal 
emitted by the donor fluorophore is quenched and, in 
some instances, that the acceptor fluorophore emits 

20 fluorescence. Fluorescence energy transfer will only 
occur when the donor and acceptor fluorophores are 
brought into close proximity by the specific binding 
reaction. Thus, in the presence of a compound which 
disrupts the specific binding, the amount of quenching 

25 is reduced resulting in an increase in the intensity 
of the fluorescent signal emitted by the donor 
fluorophore or a fall in the intensity of the signal 
emitted by the acceptor fluorophore) . 

The method of the invention is an in vivo FRET 

30 assay because it is performed in a transgenic host 
cell or organism. The transgenic cell can be any 
mammalian cell line, the transgenic organism is 
preferably C. elegans. 

The method of the invention uses genetically 

35 encoded donor and acceptor fluorophores which can be 
expressed as fusion proteins fused in frame to the 
UNC-5 protein and the interacting protein. This can 
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be readily accomplished by transforming or 
transfecting the cell or organism with appropriate 
expression vectors arranged to express the fusion 
proteins . 

5 In a preferred embodiment the genetically encoded 

donor and acceptor proteins are variant green 
fluorescent proteins which exhibit different 
fluorescent properties and which have suitably 
overlapping emission/absorption spectra, such as EGFP 

10 (enhanced green fluorescent protein) and EBFP 

(enhanced blue fluorescent protein) . As would be 
readily apparent to persons skilled in the art, the 
FRET assay can be performed in either orientation. 
That is to say, the assay can be carried out using 

15 UNC-5 fused to the donor fluorophore and the 

interacting protein fused to the acceptor fluorophore 
or using UNC-5 fused to the acceptor fluorophore and 
the interacting protein fused to the donor 
fluorophore . 

20 

The invention further provides a method of 
identifying compounds which are capable of inhibiting 
or enhancing the binding of an UNC-5 protein to an 
interacting protein previously identified as binding 
25 to the said UNC-5 protein, which method comprises: 

providing a first reaction component 
comprising a first protein linked to a solid 
support containing a scintillant and a second 
reaction component comprising a second protein 
30 which has been radioactively labelled, wherein 

the first and second proteins are an UNC-5 
protein or a fragment thereof and an interacting 
protein or a fragment thereof; 

bringing the first and second reaction 
35 components into contact in an aqueous solution in 

the presence of a compound under test; and 

detecting binding of the first protein to 
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the second protein by detecting light emission 
from the scintillant. 

The above method is based on the scintillation 
5 proximity assay (SPA™) developed by Amersham and 

commonly used in automated high throughput screening. 
In order to perform this assay a first interacting 
protein (e.g. an UNC-5 protein) must be linked onto a 
bead containing a scintillant. Linking of the protein 

10 to the beads can be carried out in many different 

ways, including, for example, via biotin-streptavidin 
affinity binding. Streptavidin-SPA beads are 
commercially available from Amersham and the 
interacting protein can easily be biotinylated in 

15 vitro or expressed as a biotinylated fusion protein 
using techniques known in the art. The second 
interacting protein (e.g. a protein known to interact 
with UNC-5) is labelled with radioactivity. This can 
be achieved, for example, by synthesising the second 

20 interacting protein by in vitro translation and 
incorporating a tritiated precursor amino acid. 
The SPA™ assay protocol is then as follows: 

SPA beads linked to the first interacting protein 
are incubated for 30 minutes to one hour with a sample 

25 containing the radioactively labelled second 
interacting protein. Upon binding of the two 
interacting proteins, the radioactivity emitted by the 
labelled protein is brought into close proximity with 
the bead containing scintillant and therefore induces 

30 light emission from the scintillant. The free labelled 
protein in sample (non-bound) will not be held in 
sufficiently close proximity to the beads to induce 
light emission. Compounds which disrupt the binding 
of the first and second interacting proteins will 

35 cause a decrease in the amount of light emitted during 
the experiment. 

As would be readily apparent to persons skilled 
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binding of UNC-5 to the interacting protein. The 
interacting protein (or fragment thereof) is labelled 
with a tag which is directly or indirectly detectable, 
typically a fluorescent molecule such as GFP, or a tag 
5 which is detectable by specific antibody binding, such 
as a His-tag or GST-tag. Many other tag molecules 
which are equally suitable for this purpose are known 
in the art and are available commercially. The wells 
are then washed to remove the compound and any 

10 interacting proteins which remain unbound. Any 

interacting protein which has become bound to UNC-5 is 
not removed by the washing step and can be detected 
via the directly or indirectly detectable tag. If the 
interacting protein is labelled with a GFP tag, then 

15 bound proteins are detected by measuring GFP 

fluorescence; if the interacting protein is labelled 
with a His-tag or a GST tag, bound proteins are 
detected with immunological techniques, using an 
antibody of the appropriate specificity. 

20 Compounds which disrupt the binding of UNC-5 to 

the interacting protein will result in more of the 
protein remaining unbound, hence less protein will be 
detected after the washing step. 

25 The invention further provides a method of 

identifying compounds which are capable of inhibiting 
or enhancing the binding of an UNC-5 protein to an 
interacting protein previously identified as binding 
to the said UNC-5 protein, which method comprises: 
30 exposing a cell or organism expressing UNC-5 

and overexpressing nucleic acid encoding an 
interacting protein to the compound under test; 
and 

screening for reversion of the 
35 overexpression phenotype of the cell or organism 

to wild-type. 
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Over-expression of genes encoding for proteins 
which interact with UNC-5 in a cell line or in C. 
elegans results in an over-expression phenotype. 

Assays to select for compounds that inhibit the 
5 interaction of UNC-5 and its interacting proteins can 
therefore be performed in cell lines or C. elegans by 
exposing cells or worms exhibiting an over-expression 
phenotype to the compound under test and screening for 
a 'reduction' of the over-expression phenotype (i.e. 

10 screening for a reversion to wild-type) . 

Over-expression of proteins which interact with 
unc-5 in C. elegans typically results in neuronal 
outgrowth phenotypes, distal tip cell outgrowth 
phenotypes, and other aberrant outgrowth of various 

15 tissues and cells. These phenotypes can be easily 
monitored by expressing reporter genes, such as 
fluorescent proteins in these cells. Reduction of the 
phenotype induced by the over-expression can then be 
monitored by visual inspection. 

20 Simple assays have been developed to screen for 

compounds which cause reversion of the over-expression 
phenotype in cell lines. As Unc-5 receives signals 
from the netrins, over-expression of proteins which 
interact with unc-5 typically causes phenotypic 

25 changes in neuronal outgrowth and cell movement. 

Accordingly, the step of screening for reduction of 
the over-expression phenotype can be performed using a 
laminin assay, a netrin response assay and assays 
using agarose concentration gradients, a boyden 

30 chamber or stratified layers (see Gundersen, R. W., 

Dev. Biol., 1987, 121(2): 423-431; Klostermann, S. and 
Bonhoeffer, F. , 1996, 4: 237-252). In general, these 
methods are based upon providing attractants or 
repellants for axonal guidance in a controlled manner. 

35 The way the cells react to these attractants and 
repellants forms the basis of the assay. In the 
Boyden chamber (upper and lower chambers separated by 
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a filter barrier) one typically cultivates cells in 
the upper chamber and measures how the cells grow 
through the filter. The agarose approach allows the 
establishment of gradients to which the cells react by 
5 forming specific patterns. 

The above-listed methods are all based upon novel 
interactions between an UNC-5 protein and proteins 
shown to physically interact with the UNC-5 protein. 
10 In preferred embodiments, the UNC-5 protein is a C. 
elegans UNC-5 protein or a human UNC-5 protein. 
Preferably the human UNC-5 protein is UNC-5C or any of 
the UNC-5C splice variants identified hereinbefore or 
UNC-5HS1. 

15 The methods of the invention can also be carried 

out using fragments of the UNC-5 protein which retain 
the ability to bind to the interacting protein. 
Preferably the fragment comprises the intracellular 
portion of the protein. Various sub-domains of the 

20 intracellular portion of the protein or combinations 
thereof can also be used. 

As used herein the term "interacting protein" 
encompasses any protein which has been demonstrated to 
interact with an UNC-5 protein. The interacting 

25 protein can be a second UNC-5 protein as the examples 
included herein demonstrate the ability of UNC-5 to 
form homodimers. The interacting protein can also be 
a protein identified as interacting with UNC-5 in a 
yeast two hybrid experiment. A list of proteins 

30 identified as interacting with C. elegans UNC-5 or 

human UNC-5 in a yeast two hybrid experiment is given 
in the Example 4, below. Any of these proteins, or 
fragments thereof which retain a functional UNC-5 
binding site, can be used in the methods of the 

35 invention in combination with the appropriate UNC-5 
protein or a fragment thereof. 

As would be readily apparent to persons skilled 
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in the art, the UNC-5 signalling pathway is highly 
conserved across species. Hence it is to be expected 
that for every interacting protein identified in the 
yeast two hybrid experiments described in the Examples 
5 given herein a homologous interacting protein will be 
found in other species. For example, for every 
interacting protein found in C. elegans to interact 
with the C. elegans unc-5 protein it is expected that 
a homologous interacting protein will be found in 

10 humans and will interact with a human UNC-5 protein, 
and vice versa for interacting proteins first 
identified in humans. Accordingly, it is within the 
scope of the invention to perform the methods 
described above with "homologous combinations'' of UNC- 

15 5 proteins and interacting proteins and even with 

cross-species combinations e.g. C. elegans unc-5 and a 
human interacting proteins, human UNC-5 and a human 
homologue of an interacting protein identified in C. 
elegans ; C. elegans unc-5 and a human homologue of an 

20 interacting protein identified in elegans; C. 

elegans unc-5 and a human interacting protein etc. 
Lists of homologues of the C. elegans and human 
interacting proteins identified in the yeast two 
hybrid study are given in the Examples included 

25 herein. 



In a still further aspect the invention provides 
a method of identifying compounds which reduce or 
inhibit the lethal phenotype associated with the 
30 expression of the UNC-5 death domain in yeast, which 
method comprises: 

exposing a yeast cell containing an 
expression vector comprising nucleic acid 
encoding an UNC-5 protein or a fragment thereof 
35 comprising the death domain to a compound under 

test ; 
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allowing the yeast ceils to grow in the 
presence of the compound; and 

screening for a reduction or Inhibition of 
the lethal phenotype associated with the 
5 expression of the UNC-5 death domain in yeast. 

The UNC-5 protein used in the method of the 
invention is preferably a C. elegans UNC-5 protein or 
a human UNC-5 protein. Preferably the human UNC-5 
10 protein is UNC-5C or any of the UNC-5C splice variants 
identified hereinbefore or UNC-5HS1. 

In a still further aspect the invention provides 
a method of identifying suppressers of the lethal 
15 phenotype associated with the expression of the UNC-5 
death domain in yeast, which method comprises: 

transfecting yeast cells containing an 
expression vector comprising nucleic acid 
encoding an UNC-5 protein or a fragment thereof 
20 comprising the death domain with a cDNA library 

cloned in a yeast expression vector; 

allowing the transfected yeast cells to grow 
for one or more cell divisions; and 

screening for reduction or inhibition of the 
25 lethal phenotype associated with the expression 

of the UNC-5 death domain in yeast. 

Optionally, the method further comprises the 
steps of: 

30 identifying a transfected yeast cell 

exhibiting a reduction or inhibition of the 
lethal phenotype associated with the expression 
of the UNC-5 death domain in yeast; and 

isolating the cDNA clone (s) present in the 

35 transfected yeast cell which is/are responsible 

for conferring reduction or inhibition of the 
lethal phenotype. 
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Again, the UNC-5 protein is preferably a C. 
elegans UNC-5 protein or a human UNC-5 protein. 
Preferably the human UNC-5 protein is UNC-5C or any of 
the UNC-5C splice variants identified hereinbefore or 
5 UNC-5HS1. The cDNA library is preferably a C. elegans 
cDNA library or a human cDNA library. 

The invention will be further understood with 
10 reference to the following experimental examples, 
together with the accompanying Figures in which: 

Figure 1 shows a sequence alignment of the known human 
15 unc-5C cDNA sequence and the three novel alternative 
splice variants of unc-5C. The region of alignment 
corresponds to the portion of the cDNA which encodes 
the intracellular domains of unc-5C. 

20 Figure 2 shows a multiple alignment of unc-5Hl genes. 
ym97dl2 is an EST clone containing a fragment of the 
unc-5HSl cDNA, 3D is a fragment of the unc-5HSl cDNA 
cloned by PCR in Example 2. 

25 Figure 3 summarises the cloning of human unc-5C 
variants . 

Figure 4 summarises the cloning of human unc-5HSl. 

30 Figure 5 is a schematic representation of the human 
unc-5C splice variants. 

Figure 6 shows an alignment between a fragment of the 
protein encoded by the cDNA fragment cloned in pYMP6 
35 and the rat neurexim II-alpha-b cDNA. 

Figure 7 shows an alignment between a fragment of the 
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protein encoded by the cDNA fragment cloned in pYMP17 
and the mouse mena protein. 

Figure 8 is a representation of the vector pGC1037. 
Figure 9 is a representation of the vector pGC1003. 



Example 1 

10 Cloning of the human unc-5C splice variants. 

Splice variants of human unc-5C were cloned, 
primary with RACE technology. 

A 5' RACE was performed using the 5' RACE System 
for Rapid Amplification of cDNA Ends, Version 2.0 
15 (GibcoBRL, Merelbeke, Belgium) , according the 

instructions supplied by the manufacturer or with 
minor modifications thereof. The primers were based on 
the unc-5 EST ym97dl2 . 

The first strand cDNA synthesis was performed with 
20 primer: 

GSPl=oGC75: CGTAGCAGGCACTGGCCTCC 

PCR of dC-tailed cDNA: was performed with the gene- 
specific primer: 

GSP2=oGC7 6 : GCACTGGCCTCCAGCTGGCAGTAG 
25 and the RACE anchor primer supplied with the 5' RACE 
system. 

The PCR Program was: 

Step 1 94 °C, 2 min 
Step 2 94°C, 30 sec 
30 Step 3 60°C, 30 sec 

Step 4 72°C, 2 min 
Repeat steps 2 to 4 for 35 cycles 
Step 5 72°C, 7 min 
Step 6 4°C 



35 



A nested PCR was performed with gene-specific primer: 
GSP3=oGC77 : AGTAGAGGTGGGAGGGCGCCTCCTCGCCCAG 
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and 5' RACE anchor primer 

The PCR program was: 

Step 1 92°C, 2 min 
5 Step 2 92°C, 1 min 

Step 3 68°C, 2 min 
Repeat steps 2 and 3 for 35 cycles 
Step 4 72°C, 7 min 
Step 5 4°C 

10 The resulting RACE products were visualised by 

electrophoresis on agarose gels, the bands excised 
and purified with Jetsorb (Genomed, Germany) . The RACE 
products were ligated into plasmids pAS2 and pGEX-5X-3 
with T4 DNA ligase (Amersham pharmacia biotech, NJ, 

15 USA), or into a TA cloning vector (Invitrogen, 

Groningen, the Netherlands) . Plasmid DNA was purified 
prior to sequencing using the Qiagen plasmid 
purification system (Westburg, Leusden, The 
Netherlands) . 

20 

Example 2 

Cloning of a new human unc-5 gene. 

Human Brain Poly A+ RNA was obtained from 
Clontech, California, USA and first strand cDNA 
25 synthesis performed with the Ready To Go T-Primed 
First-Strand Kit ( (Amersham pharmacia biotech, NJ, 
USA) . 

Primers were: 
for PCR1: 

30 oGC 5 6 : CCGGAATTCCATATGTTAATACTGCCCTTCTGCTGCTAA 
oGC 6 6 : GCGATCTCTGTAGTTGTGGCCTTG 
PCR program was: 

Step 1 94°C, 1 min 
Step 2 53°C, 30 sec 
35 Step 3 72°C, 2 min 

Repeat steps 1 to 3 40 times 
Step 4 72°C, 7 min 
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Step 5 4°C 
for PCR2 

OGC63 : GGGAATTCCATATGTTGTTTGTGTATCGGAAGAATCATC 
5 OGC64: ACGCGTCGACTTAATACTGCCCTTCTGCTGCTAAGGAC 
oGC 6 5 : CCGGAATTCCTTGTTTGTGTATCGGAAGAATCATC 
PCR program was: 

Step 1 94°C, 5 min 

Step 2 92°C f 30 sec 
10 Step 3 55°C, 30 sec 

Step 4 72°C, 2 min 

Repeat steps 2 to 4 for 25 cycles 

Step 5 72°C / 7 min 

Step 6 4°C 

15 

The resulting PCR products were isolated, cloned 
and analysed as described in Example 1. 

SEQ ID NO: 7 shows the sequence of a PCR product 
isolated using the above PCR strategy. This PCR 

20 product was designated clone 3D. Figure 2 shows an 
alignment between the Rattus norvegicus unc-5Hl cDNA 
sequence, the sequence of EST ym97dl2, the sequence of 
clone 3D and the sequences of several other PCR 
products amplified using the above PCR strategy (1G, 

25 IJrc and 2Brc) . 



Example 3 

Cloning of two of the fragments of UNC-5 for the 
30 dimeri zation experiment . 

A PCR amplification was performed with following 
primers : 

UNC5F: GGT GGT CAT ATG GCC ATG GAG TGC TGT AAA CGT GGC 
35 AAT TCA AAA AAG 

UNC5R: GGC TGC AGG TCG ACG CCC CGG GGC TTA TGG GGA CAC 
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AAT TTG TGG 



Using the cDNA library used in the yeast two hybrid 
experiment (Example 4) as template. 

5 

PCR program was : 

Step 1 94°C, 1 min 
Step 2 53°C, 30 sec 
Step 3 72°C, 2 min 
10 Repeat steps 1 to 3 for 25 cycles 

Step 4 72°C, 7 min 
Step 5 4°C 

The resulting PCR products were isolated and 
15 cloned in frame as Ncol/Sall fragments in the vectors 
pAS2 and pGAD424 supplied by Clontech (Palo Alto, 
California, USA) . 



20 Example 4 

Yeast two Hybrid Experiments 

To address the functional role of unc-5 the 
inventors used the yeast two hybrid method (Fields and 
Song, Nature 340:245, 1989), a method well known to 

25 molecular biologists, to search for the proteins that 
interact with the UNC-5 protein. 

The two hybrid method is based on a pair of 
fusion proteins- The first fusion protein comprises a 
first of two interacting proteins fused to the 

30 transcriptional activation domain of a bipartite yeast 
transcription factor; the second fusion protein 
comprises the second of two interacting proteins fused 
to the DNA binding domain of the bipartite yeast 
transcription factor. The principle of the method is 

35 that if the two domains of the bipartite transcription 
factor are physically brought together by binding of 
the first and second interacting proteins then the 
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resulting complex will be able to activate 
transcription from a promoter which contains a target 
binding site for the transcription factor. The tv/c 
hybrid assay is commonly used to study protein-protein 
5 interactions between two known proteins. It can also 
be used to screen a library of proteins to identify 
proteins which interact with a given protein. Both of 
these uses of the two hybrid system are well known to 
those skilled in the art. 

10 In the present invention, the yeast two hybrid 

assay was used to identify proteins which interact 
with C. elegans UNC-5 or human UNC-5 as follows: the 
intracellular part of UNC-5 or parts thereof were 
cloned in fusion with the DNA-binding domain of the 

15 yeast transcription factor GAL 4 . A cDNA library was 
cloned into a vector containing the transcriptional 
activation domain of GAL 4 . The fusion proteins were 
then independently expressed together in yeast 
containing a reporter gene under the transcriptional 

20 control of a promoter containing GAL 4 binding sites 
(typically GAL1 lacZ or GAL1-HIS3) . 

Methods 

(A) Construction of the C. eleaans library and 
25 standard veast two hybrid experiments. 

Construction of C. elegans cDNA libraries, and 
yeast two hybrid experiments with C. elegans cDNA were 
performed as described by Elledge et al., Proc. Natl. 
Acad. Sci. f 1991, 88:1731-1735, or using the 
30 Matchmaker™ maker system supplied by Clontech, 

California, USA according to the protocol supplied by 
the manufacturer, or by minor modifications of the 
above-described methods. 

35 

(B) A mating yeast two hybrid experiment. 

Mating yeast two hybrid experiments were 
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performed using plasmid pGC1037 (a plasmid map of 
pGC1037 is shown in Figure 8 and the complete sequence 
of the plasmid is given in SEQ ID NO: 91) as bait, and 
a .pre-transformed Human Brain MATCHMAKER cDNA library 
5 (Clontech, California, USA) according to the protocol 
supplied by the manufacture, or with minor 
modifications thereof . 

In brief summary, the steps of the method are as 
10 follows :- 

Inoculate 1 colony containing the bait plasmid into an 
overnight culture; 

Mate the bait culture and the library culture (24 h) ; 

Plate library mating mixtures; 
15 Incubate for at least 8 days; 

Streak big colonies onto SD-3 + 5mM AT-plates (+/- 

Nylon Membrane) ; 

Stain yeast on Nylon membrane; 

Prepare yeast DNA from the positives; 
20 Perform restriction digest, if digest is successful 

perform backtransf ormation, using positive and 

negative controls ; 

Transform positives into MC1061 cells; 
Prepare bacterial DNA using Qiagen Plasmid Mini 
25 Purification kit, according to the standard Qiagen 
protocol; and 
Perform DNA sequencing. 

All positives obtained in the yeast two hybrid 
30 screen were assayed for the specificity of the 

interaction (against empty vector and irrelevant 
proteins) using the two hybrid system. 

(C) Double-stranded RNA inhibition-RNAi cloning 
35 isolation and injection. 

Double stranded RNA for RNA inhibition 
experiments was prepared according to the MEGAscript 
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protocol (Ambion, UK) . RNA isolated using this 
protocol was purified away from contaminants using the 

D?sl^> ~ . - -* jz • . - 

t^wuoj ojrotcm j_ium vj-ciy^ii vw^sLourg, ine wetneriands) , 
following the instructions for RNA clean-up supplied 
by the manufacturer. RNA was injected into the 
nematodes using standard procedures (Methods in Cell 
biology, Vol 48, Academic Press, 1995) . 

Results 

(M Auto-a ctivation and dimerization experiments. 

In a first series of experiments, the ability of 
the intracellular domain of C. elegans unc-5 or parts 
thereof to dimerize or to cause auto-activation was 
tested. Several plasmids were constructed harboring 
15 the intracellular domains of unc-5 and parts thereof. 
Various domains of unc-5, including the membrane 
proximal part (MMP) , the zonula occludens homology 
domain (ZO-1), the unknown part (UP) and the Death 
domain (DD) and were cloned in the vectors pAS2 and 
PGAD424 (Matchmaker, Clontech, CA, USA) . The resulting 
vectors are summarized in Table 1. 

Several constructs containing the death domain 
were found to be either toxic or auto-activating. 
Furthermore, by performing homo-dimerization 
25 experiments, it was found that the intracellular 

domain of UNC-5C is capable of forming a homo-dimer. 
Further experiments led to the conclusion that the 
ZO-l/UP region is probably responsible for the homo- 
dimerization. Membrane located signal receptors often 
form homo- or hetero-dimers prior to intracellular 
signal transduction. Accordingly, it is postulated 
that dimer formation in UNC-5 could be a critical 
event in signalling. Based on a knowledge of this 
dimerization it is possible to develop assays to 
35 screen for compounds which disrupt dimer formation and 
to identify unc-5 mutants which are unable to 
dimerize. 



20 



30 
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The present inventors have found that in humans 
UNC-5 proteins may be encoded by at least three genes, 
the homologous genes unc-5C, unc-5HSl, unc-5HS2. As 
UNC-5 is an important receptor involved in a vast 
amount of biological processes, it is considered that 
more functional homologous genes or unc-5 genes may 
present in the Homo sapiens genome. In addition, the 
expression of the unc-5 gene does not result in the 
production of a single transcript. The expression of 
unc-5C locus can result in the production at least 4 
isoforms as a result of alternative splicing events. 
It is possible that the other unc-5 genes will also 
express splice variants, which may encode different 
protein isoforms . Any of these unc-5 isoforms may form 
dimers, analogous to the homo-dimerisation found for 
C. elegans unc-5. Accordingly, assays can also be 
developed to screen for chemical substances that alter 
the dimerization of human unc-5 proteins. Compounds 
identified using such an assay may have 
pharmacologically useful properties . 

(B) Other receptor dimerizations . 

It has been suggested that, in addition to UNC-6, 
UNC-129 also signals to the UNC-5 receptor (Colavita 
et al., Science 261:706-709). UNC-6 is also known to 
signal to UNC-40(DCC). UNC-129 belongs to the TGF-p 
superfamily. TGF-0 receptors, including DAF-1 and 
DAF-4, do not affect axonal guidance. Although new 
TGF-p receptors may be found that are involved in 
axonal guidance, it is more likely that the UNC-129 
molecule is able to interact with TSP type I domains, 
which are present in UNC-5. Such interaction between 
TGF-)3 molecules and TSP Type I domains has been shown 
previously (Schultz-Cherry et.al., 1994, J. Biol. 
Chem. 269, 26775) . Furthermore UNC-129 is also 
involved in the UNC-40 pathway. 

Recent studies have provided support for the idea 
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that the UNC-5 receptor induces switching of UNC-40 
from attraction to repulsion (Mehlen et al., Nature 
395:801-804, 1998) . This suggests a linkage of Uric- 5 
to oncology since Unc-40 is related to vertebrate DCC 
5 (deleted in colorectal cancer) , which is a candidate 
tumour-suppressor gene, and encodes a receptor for 
netrin-1 (UNC-6) . The reversal from attraction towards 
repulsion in growth cone steering with the two 
receptors UNC-5 and UNC-4 0 can be explained by 

10 hetero-dimerization between UNC-5 and UNC-40. Such 

switching of function has also been observed in other 
biological processes. The UNC-40/UNC-5 interaction 
may function analogously to the Bax/Bcl-2 interaction 
involved in apoptosis. Bax can be considered as the 

15 protein that protects against apoptosis but the 

relative titre of both Bax and Bcl-2 in a cell may be 
important in the decision of cell death. 

Given that UNC-5 is capable of forming 
homodimers, it is postulated that UNC-5 is also 

20 capable of forming heterodimers with UNC-4 0. The UNC- 
5/UNC-40 heterodimers may act as a functional receptor 
for UNC-6 and UNC-129. Assays to isolate compounds 
that influence the interaction between UNC-5 and 
UNC-40, both enhancing and inhibiting this interaction 

25 have therefore been developed. These assays are 
analogous to the assays as described to isolate 
compounds that influence the formation of the UNC-5 
dimers and the assays for compounds that influence the 
interaction of UNC-5 with its other interacting 

30 proteins (see below) . 

(C) C. eleaans UNC-5 interacting proteins 

The intracellular part of UNC-5 containing the 
domains MPP, ZO-1 and UP cloned in vector pGC1003 (a 
35 plasmid map of pGC1003 is given in Figure 9 and the 
complete sequence of the plasmid is given in SEQ ID 
NO: 92) was used as *bait' in a yeast two hybrid 
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10 



15 



experiment screening against a C. elegans cDNA 
library. These experiments resulted in the 
identification of ten genes, including three known 
genes and seven genes with heretofore unknown 
function, encoding proteins which specifically 
interact with the intracellular part of UNC-5. 
Details of the UNC-5 interacting proteins identified 
during the two hybrid screen are given below. In most 
cases, the results of double-stranded RNA inhibition 
experiments (RNAi) designed to inhibit expression of 
the interacting protein are also given. Where 
appropriate, details of human homologues of the 
interacting protein are also given and any known 
disease associations are discussed. 



1) Spectrin p-chain / Fodrin B-chain (pC1025) 

A first series of hits resulted in the 
identification of plasmid pC1025 which contains a 
fragment of a cDNA encoding the C. elegans spectrin 

20 p-chain/Fodrin. The spectrin p-chain protein is 

encoded by the gene K11C4.3, located on chromosome IV. 

The full length cDNA and amino acid sequences of 
spectrin p-chain/Fodrin are shown in SEQ ID NOS: 11 
and 12, respectively. The nucleotide sequence of the 

25 fragment of the spectrin p-chain cDNA which is cloned 
as an insert in plasmid pC1025 is given in SEQ ID NO: 
13, the corresponding amino acid sequence is given in 

SEQ ID NO: 14 . 

RNAi experiments using a double-stranded RNA 

30 corresponding to the cDNA fragment cloned in pC1025 
revealed that inhibition of the expression of the 
native spectrin p-chain in C. elegans worms causes the 
following phenotype: no embryonal lethality, normal 
canals, normal elongation, growth retardation and 

35 growth arrest at LI and L2, nearly no movement but 
touch reflex is observed. The phenotype is 100% 
penetrant, and the larvea are short and wrinkled. 
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These RNAi phenotypes and the corresponding knock-out 
phenotype can be used as the basis of a compound 
screen in C . elegans to SHentify chemical substances 
that modulate the activity of the spectrin p-chain 
5 protein. 

Human Fodrin (genbank accession number 2493434) 
contains an extra C-terminal PDZ domain that is not 
present in spectrin (genbank accession number 134798) . 
The human fodrin seems to be more homologous to the C. 
10 elegans protein. This is in agreement with the finding 
that unc-5 is also expressed in the brain of 
vertebrates . 

The interaction between UNC-5 and fodrin could be 

15 a critical event in a cell signalling, hence compounds 
which modulate the interaction between UNC-5 and 
fodrin, particularly the interaction between human 
UNC-5 and human fodrin, may potentially have 
pharmacological activity. Assays can also be developed 

20 to screen for genetic mutations that inhibit the 

interaction needed for proper signal transduction. 
Compounds which enhance or inhibit the interaction of 
UNC-5 with fodrin and spectrin p-chain may be useful 
in the development of pharmaceutical preparations for 

25 the treatment of Crohn's disease, Sjogren's syndrome, 
secretion related diseases, diseases related to 
neutophil and platelet activation, and long-term 
potential in neurons, Alzheimer's disease, 
proliferative diseases such as carcinomas, neoplasia, 

30 and more specifically, shwannomas, meningiomas, 
ependymonas, squamous cell carcinomas, malignant 
melanomas and lung carcinomas, spherocytosis, 
pyropoikilocytosis, Duchenne muscular dystrophy and 
various neurological disorders. 

35 

2) APR-1 (PC1028) 

A second plasmid isolated in the yeast two hybrid 
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screen, pC1028, contained a fragment of a cDNA 
encoding APR-1. 

The nucleotide sequence of the full length APR-1 
cDNA is shown in SEQ ID NO: 15 and the amino acid 
5 sequence of the APR-1 protein encoded by this cDNA is 
shown in SEQ ID NO: 16 . The nucleotide sequence of 
the fragment of the APR-1 cloned in pC1028 is shown in 
SEQ ID NO: 17 , with the corresponding amino acid 
sequence shown in SEQ ID NO: 18. 

10 RNAi experiments using a double-stranded RNA 

corresponding to the fragment cloned in pC1028 
demonstrated that inhibition of APR-1 expression in C. 
elegans results in the following phenotype: more than 
95% embryonic lethality, in 25% of cases this was due 

15 to the overproduction of pharyngeal tissue and lack of 
endoderm, and premature division of the E daughters 
(Rocheleau et al., Cell 90:707-716, 1997). Escapers 
(worms that survive) have abnormal gut cells. These 
RNAi phenotypes and the corresponding knock-out 

20 phenotype can be used as the basis of a compound 

screen in C. elegans to identify chemical entities 
that modulate the activity of APC (see below) , and 
hence the unc-5 pathway. 

25 Further yeast two hybrid experiments were 

performed in order to more precisely determine the 
position of the APR binding regions in UNC5, using the 
UNC5 domains MPP, ZO-1, UP and combinations thereof. 
APR-1 seemed to associate with two distinct regions in 

30 UNC5. First, APR-1 appears to bind to the MPP domain. 
Secondly, APR-1 appears to binding to the ZO-l/UP 
domain. APR-1 seems to bind less to the ZO-1 and UP 
domains when they are present alone and not in 
combination. A similar experiment was carried out 

35 using the C. elegans UNC-5 protein, and domains of 

human APC and analogous results were obtained. It is 
concluded that APC is capable of binding to two 
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distinct regions of UNC-5, the MPP and the ZO-l/UP 
domains . 

w w ^>^w»»w^.j.* w«.-.s-r ^ uuu ni. ni. u v^wuxu 

be a critical event in cellular signalling and hence 
5 compounds which modulate this interact ion , 

particularly compounds which modulate an interaction 
between human UNC-5 and human APC, may potentially 
have pharmacological activity. Furthermore genetic 
mutations, and splice variants can be identified that 

10 inhibit the interaction, needed for proper signal 

transduction. Compounds which enhance or inhibit the 
interaction of UNC-5 with APR/APC may be useful in the 
development of pharmaceutical agents for the treatment 
of neurological diseases and colorectal cancers such 

15 as adenomatous polyposis coli. 

3) UNC-14 (PC1034) 

A third plasmid identified during the yeast two 
hybrid screen using C. elegans UNC-5 as bait (pC1034) 

20 was found to contain a fragment of the UNC-14 cDNA. 

The nucleotide sequence of the full length UNC-14 
cDNA is shown in SEQ ID NO: 19, the amino acid 
sequence of the protein encoded by this cDNA is given 
in SEQ ID NO: 20. The nucleotide sequence of the 

25 fragment of the UNC-14 cDNA cloned as an insert in 
pC1034 is shown in SEQ ID NO: 21, with the 
corresponding amino acid sequence of the polypeptide 
encoded by this fragment shown in SEQ ID NO: 22. 

C. elegans worms mutated in unc-14 are observed 

30 to be very sluggish, almost paralysed, small, 

dumpyish, with a tendency to coil and show some egg 
retention. This phenotype can be used as the basis of 
a compound screen in C. elegans to identify chemical 
entities that modulate the activity of UNC-14. 

35 Furthermore, C. elegans worms mutated in the 

unc-14 gene were shown to have abnormal axonal 
elongation and axonal structures. The unc-14 gene 
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encodes a protein of 665 amino acids, and is 
co-expressed with the unc-51 gene in the cell bodies 
and axons of almost all neurons including DD/VD and 
hermaphrodite-specific neurons. The results of yeast 
5 two-hybrid experiments suggested that a central region 
of UNC-14 binds to the carboxy-terminal region of 
UNC-51, and that the UNC-51 carboxy-terminal region 
oligomerized (Ogura et al., Genes Dev. 11:1801-1811, 
1997) . 

10 Mutations in the unc-51 gene, isolated from 

mutants of Caenorhabditis elegans exhibiting abnormal 
axonal extension and growth, encodes a novel 
serine/threonine kinase (K. Ogura, et al., 1994, Genes 
Dev. 8: 2389- 2400) . 

15 

4) F11A10.1 (PGC1021) 

A fourth plasmid isolated during the yeast two 
hybrid screen, pGC1021, was found to contain a 
fragment of cDNA corresponding to the C. elegans gene 

20 designated F11A10.1. 

The nucleotide sequence of the full length 
F11A10.1 cDNA is shown in SEQ ID NO: 23, with the 
amino acid sequence of the protein encoded by this 
cDNA shown in SEQ ID NO: 24. The nucleotide sequence 

25 of the fragment of the F11A10.1 cDNA cloned in pGC1021 
is shown in SEQ ID NO: 25 , the amino acid sequence of 
the protein fragment encoded by this fragment of the 
cDNA is shown in SEQ ID NO: 26. 

To date, no function is as yet known for 

30 FllAlO^l. RNAi experiments using a double-stranded 

RNA corresponding to the insert of pGC1021 showed that 
inhibition of F11A10.1 expression in C. elegans 
results in worms which are weakly constipated. In C. 
elegans , constipation has been associated with 

35 neuronal dysfunction (Thomas, Genetics 124:855-872, 
1990) . Furthermore and remarkably inhibition of 
F11A10.1 expression causes migration defects in the 
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distal tip cell, similar to those observed in unc-5 
mutants and unc-14/unc-51 double mutants. These RNAi 
phenotypes and the corresponding knock-out phenotypes 
can be used as the basis of a compound screen in C. 
5 elegans to identify chemical entities that modulate 
the activity of F11A10.1. 

The interaction between UNC-5 and F11A10.1 could 
be a critical event in cellular signalling and hence 
compounds which modulate this interaction may 

10 potentially have pharmacological activity. 

Furthermore, genetic mutations, and splice variants 
can be identified that inhibit the interaction, needed 
for proper signal transduction. Compounds which 
enhance or inhibit the interaction of UNC-5 with 

15 F11A10.1 may be of use in the development of 

pharmaceutical compositions useful in the treatment of 
neurological disorders, tumours such as Kaposi's 
Sarcoma, immunological disorders and diseases related 
to vesicle fusion, proteolysis, peroxisomal and 

20 mitochondrial biogenesis, and transcription. 

5) C15E6.1/2 (PGC1026) 

A fifth plasmid identified during the yeast two 
hybrid experiment, pGC1026, was found to contain a 
25 fragment of a cDNA encoding the C15E6.1 protein. 

The nucleotide sequence of the full length 
C15E6.1/2 cDNA is shown in SEQ ID NO: 27, with the 
amino acid sequence of the protein encoded by this 
cDNA shown in SEQ ID NO: 28. The nucleotide sequence 
30 of the fragment of the C15E6.1/2 cDNA cloned in 

pGC1026 is shown in SEQ ID NO: 29, the amino acid 
sequence of the protein fragment encoded by this 
fragment of the cDNA is shown in SEQ ID NO: 30. 

RNAi experiments using a double-stranded RNA 
35 corresponding to the insert of pGC1026 did not result 
in any clear visual phenotype. 

The identification of C15E6.1/2 as an UNC-5 
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interacting protein indicates that UNC-5 might be a 
band 4 . 1 binding protein and may share homology with 
other band 4.1 binding proteins such as CD44, 
glycophrin C, and paranodin. 
5 By using the band 4.1 signature to search a 

database of C.elegans genes, F07A11.1 on chromosome II 
was identified as encoding a band 4.1 protein. 

The interaction between UNC-5 and C15E6.1/2 could 
be a critical event in cellular signalling and hence 

10 compounds which modulate this interaction may 

potentially have pharmacological activity. Furthermore 
genetic mutations, and splice variants can be 
identified that inhibit the interaction, needed for 
proper signal transduction. Compounds which enhance 

15 or inhibit the interaction of UNC-5 with C15E6.1/2 may 
be useful in the development of pharmaceutical 
preparations for the treatment of diseases related to 
axonal signalling, synaptic vesicle exocytosis, cell 
adhesion, cytoskeleton associated proteins, cell 

20 morphology , cell growth, allergic inflammatory 
processes and rheumatoid arthritis. 

6) D1081.7 (PGC1027) 

A sixth plasmid identified during the two hybrid 
screen was found to contain a fragment of cDNA 
corresponding to the C. elegans gene designated 
D1081.7. 

The nucleotide sequence of the full length 
D1081.7 cDNA is shown in SEQ ID NO: 31, the amino acid 
sequence of the protein encoded by this cDNA is given 
in SEQ ID NO: 32. The nucleotide sequence of the 
fragment of the D1081.7 cDNA cloned as an insert in 
pGC1027 is shown in SEQ ID NO: 33, with the 
corresponding amino acid sequence of the polypeptide 
encoded by this fragment shown in SEQ ID NO: 34. 

RNAi experiments performed using double stranded 
RNA corresponding to the insert in pGC1027 appeared 
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not to result in any clear visual phenotype. 

All genes so far found in C. elegans have human 
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also have vertebrate, including human, homologues. 
5 These homologues can be cloned using standard 
technologies . 

The interaction between UNC-5 and D1081.1 could 
be a critical event in cellular signalling and hence 
compounds which modulate this interaction may 
10 potentially have pharmacological activity and thus be 
of use in the development of pharmaceutical 
compositions. Furthermore genetic mutations, and 
splice variants may be identified that inhibit the 
interaction needed for proper signal transduction. 

15 

7) B0238.9 (PGC1032) 

A seventh plasmid identified during the two 
hybrid screen was found to contain a fragment of cDNA 
corresponding to the C. elegans gene designated 

20 B0238.9. 

The nucleotide sequence of the full length 
B0238.9 cDNA is shown in SEQ ID NO: 35, the amino acid 
sequence of the protein encoded by this cDNA is given 
in SEQ ID NO: 36. The nucleotide sequence of the 

25 fragment of the B0238.9 cDNA cloned as an insert in 
pGC1032 is shown in SEQ ID NO: 37, with the 
corresponding amino acid sequence of the polypeptide 
encoded by this fragment shown in SEQ ID NO: 38. 

B0238.9 is located in the chromosomal region 

30 where seu-2 is also located. The seu-2 was identified 
in suppressor screens of ectopically expressed unc-5 
and is considered to be involved in the unc-5 pathway 
(Colavita and Culotti, Dev. Biol. 194:72-85, 1998). As 
a gene has now been isolated that interacts with 

35 unc-5, it is high probable that 30238.9 is the same as 
seu-2. Mutations in seu-2 appeared not to have any 
visual phenotype, as was also observed in RNAi 
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experiments using a double stranded RNA corresponding 
to a fragment of B0238.9. The finding that SEU-2 is a 
suppressor and a binding partner to UNC-5 validates 
the importance of this interaction. Other known 
5 suppressors of ectopic unc-5 growth cone steering are 
unc-6, unc-40, unc-34 , unc-44, unc-129 , seu-2, seu-2 , 
and seu-3. Mutations in some of these genes show 
axonal guidance defects, unlike seu-2. 

Homology searches in the EST database with 
10 B0238.9 revealed the presence of at least two human 
ESTs with significant homology. The ESTs so found, 
nz77b06 and yu53g01, can be used as basis to clone the 
full length cDNA encoding the human homologue of 
B0238.9. 

15 The interaction between UNC-5 and B0238.9 could 

be a critical event in cellular signalling and hence 
compounds which modulate this interaction may 
potentially have pharmacological activity and thus may 
be useful in the development of pharmaceutical 

20 compositions. Furthermore genetic mutations, and 

splice variants may be identified that inhibit the 
interaction, needed for proper signal transduction. 

8) ZC404.8 (PGC1033) 

25 An eighth plasmid identified during the two 

hybrid screen was found to contain a fragment of a 
cDNA corresponding to the C. elegans gene designated 
ZC404 .8. 

The nucleotide sequence of the full length 
30 ZC404.8 cDNA is shown in SEQ ID NO: 39, the amino acid 
sequence of the protein encoded by this cDNA is given 
in SEQ ID NO: 40. The nucleotide sequence of the 
fragment of the ZC404.8 cDNA cloned as an insert in 
pGC1033 is shown in SEQ ID NO:. 41, with the 
35 corresponding amino acid sequence of the polypeptide 
encoded by this fragment shown in SEQ ID NO: 42. 

RNAi experiments using a double stranded RNA 
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corresponding to a fragment of this gene resulted in 
an embryonic lethal phenotype. The worms showed no 
elongation and only very little muscle activity- the 
hypodermis is clearly abnormal. 
5 Homology searches in the EST database with 

ZC404.8.9 revealed the presence of at least three 
human ESTs with significant homology. The ESTs thus 
identified, qe69h03, zx61d04, and zd35el0, can be used 
as basis to clone the full length cDNAs. 

10 The interaction between UNC-5 and ZC404.8 could 

be a critical event in cellular signalling and hence 
compounds which modulate this interaction may 
potentially have pharmacological activity and thus be 
useful in the development of pharmaceutical 

15 preparations. Furthermore genetic mutations, and 

splice variants may be identified that inhibit the 
interaction, needed for proper signal transduction. 

9) vk!7a3 (pGC1023) 

20 A ninth plasmid identified during the yeast two 

hybrid experiment was found to contain a fragment of 
cDNA corresponding to the C. elegans gene designated 
ykl7a3. 

The nucleotide sequence of the fragment of the 
25 ykl7a3 cDNA cloned as an insert in pGC1023 is shown in 
SEQ ID NO: 43, with the corresponding amino acid 
sequence of the polypeptide encoded by this fragment 
shown in SEQ ID NO: 44. 

RNAi experiments using a double stranded RNA 
30 corresponding to a fragment of ykl7a3 resulted in the 
following phenotypes in C. elegans: Very slow growth, 
and the larvae get typical darker spots as they get 
older. Inhibition of ykl7a3 expression in some non 
wild-type genetic backgrounds leads to defective 
35 moulting, where the worm cannot escape from the old 
cuticle and therefore shrinks and stays in the L4 
stage. The defective moulting phenotype is also 
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observed when ykl7a3 expression is inhibited on a 
wild-type genetic background, although the phenotype 
is less prominent. Worms which escape the defective 
moulting phenotype show defects in vulva development, 
either lacking a vulva altogether or having a vulva 
which is non-functional. 

Homology searches in the Genbank database with 
ykl7a3 revealed the presence of at least one human 
homologue of this gene, designated KIAA0187. 

The interaction between UNC-5 and ykl7a3 
(KIAA0187) could be a critical event in cellular 
signalling and hence compounds which modulate this 
interaction may potentially have pharmacological 
activity. Furthermore genetic mutations, and splice 
variants may be identified that inhibit the 
interaction, needed for proper signal transduction. 
Compounds which enhance or inhibit the interaction of 
UNC-5 with ykl7a3 may be of use in the development of 
pharmaceutical compositions for the treatment of 
CADASIL, artheriohepatic dysplasia, Alzheimer's 
disease, neoplasia such as T-cell acute lymphoblastic 
leukemia and certain cancers, such as pancreatic 
cancer and colon cancer. 

25 10) F41H10.3 (pGC1020) 

A tenth plasmid identified using the yeast two 
hybrid experiment was found to contain a fragment of a 
cDNA corresponding to the C. elegans gene designated 
F41H10.3. 

30 The nucleotide sequence of the full length 

F41H10.3 cDNA is shown in SEQ ID NO: 45, the amino 
acid sequence of the protein encoded by this cDNA is 
given in SEQ ID NO: 46. The nucleotide sequence of 
the fragment of the F41H10.3 cDNA cloned as an insert 

35 in pGC1020 is shown in SEQ ID NO: 47, with the 

corresponding amino acid sequence of the polypeptide 
encoded by this fragment shown in SEQ ID NO: 48. 
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F41H10.3 harbors a ATP/GTP binding domain- 
Worms resulting from RNAi experiments using a 
double stranded RNA corresponding to a fragment of 
F41H10.3 did not exhibit a clear visual phenotype. 
5 All genes so far found in C. elegans have human 

homologues. It is therefore expected that F41H10.3 
v will also have vertebrate, including human, 

homologues. These homologues can be cloned using 
standard technologies well known to persons skilled in 
10 the art. 

The interaction between UNC-5 and F41H10.3 could 
be a critical event in signalling and compounds which 
modulate this interaction may potentially have 
pharmacological activity and thus be useful in the 
15 development of pharmaceutical preparations. 

Furthermore genetic mutations, and splice variants may 
be identified that inhibit the interaction, needed for 
proper signal transduction. 

20 (D) Human UNC-5 interacting proteins. 

The intracellular part of the human UNC-5 protein 
(human UNC-5HS1) containing the domains ZO-1, UP and 
DD cloned in vector pGC1037 (see above) was used as 
*bait' in a yeast two hybrid experiment screening 

25 against a pretransf ormed human brain Matchmaker cDNA 
library (Clontech, Palo Alto, California USA) using 
the mating screen approach described above. These 
experiments resulted in the identification of six 
genes encoding proteins which interact with UNC-5, 

30 including two known genes and four heretofore unknown 
genes . 

All proteins found in this yeast two hybrid 
screen with the human UNC-5 were different to the 
proteins found in the screen with the C. elegans 
35 UNC-5 . There are at least two reasons for this 

variation in the isolated proteins. First, the screens 
are not saturated, which means that not all possible 
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interacting proteins have been isolated, neither in 
the screen with the C. elegans UNC-5 nor in the screen 
with the human UNC-5. Secondly, different 
intracellular fragments have been used in the screens. 
5 In the C. elegans UNC-5 screen, the intracellular 

domains MPP, ZO-1 and UP were used as bait, whereas in 
the human UNC-5 screen, the intracellular domains 
ZO-1, UP and DD were used as bait. Proteins with 
specific interaction patterns will not be isolated if 

10 the necessary interacting domain is missing, or if the 
optimal combination of domains is missing. This has 
been shown in the C. elegans UNC-5 interaction with 
APR. APR interacts clearly with the MPP domain and the 
domain combination ZO-1, UP, but interacts less 

15 efficiently to with domain combination MPP, ZO-1, 
although the MPP domain is present. APR binds 
efficiently to the domain combination MPP, ZO-1, UP. 

The human UNC-5 interacting proteins identified 
during the two hybrid screen are listed below. In 

20 each case, any known disease associations are 

discussed and genes/cDNAs encoding homologous C. 
elegans proteins are listed. 

1) i-beta-1 , 3-N-acetylaminyltansf erase (pYMPS) . 

25 A first plasmid identified during the yeast two 

hybrid experiment was found to contain a fragment of 
the cDNA encoding i-beta-1 , 3-N-acetylaminyltansf erase . 

The nucleotide sequence of the full length 
i-beta-1, 3-N-acetylaminyltansf erase cDNA is shown in 

30 SEQ ID NO: 49, the amino acid sequence of the protein 
encoded by this cDNA is given in SEQ ID NO: 50. The 
partial nucleotide sequences of the fragment of the 
i-beta-1, 3-N-acetylaminyltansf erase cDNA cloned as an 
insert in pYMPS are shown in SEQ ID NOs: 51 and 52, 

35 with the corresponding amino acid sequence of the 

polypeptide encoded by these partial sequences shown 
in SEQ ID NO: 53. 
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C. elegans has at least seven putative homologues 
of i-beta-1, 3-N-acetylaminyltansf erase, designated 
F22F7.6, C18G1.3. K09C8.4, F21H7.10, C54C8.2. F56H6.6 
and T15D6.4. cDNA and/or amino acid sequences for 
5 each of these putative homologues are given herein. 
Amino acid and nucleotide sequences for these 
homologues are given in SEQ ID NOS: 66 to 82. 

The interaction between UNC-5 and 
beta-l f 3~N-acetylglucosaminyltransf erase could be a 

10 critical event in signalling and hence compounds which 
modulate this interaction may potentially have 
pharmacological activity. Furthermore genetic 
mutations, and splice variants may be identified that 
inhibit the interaction, needed for proper signal 

15 transduction. Compounds which modulate the interaction 
of UNC-5 with beta-1, 3-N-acetylglucosaminyltransf erase 
may be useful in the development of pharmaceutical 
preparations for the treatment of synaptic cleft 
dysfunctions, vesicle transport dysfunctions, 

20 inflamation, various tumours and more particular in 

tumour cell adhesion, migration and invasion, such as 
pancreas cancer, squamous cell cancer, human breast 
cancer, thyroid neoplasms, colorectal carcinomas. 

25 2) new gene with slight homology to neurexin 
II- alpha -b (NHII) (r>YMP6) 

A second plasmid identified during the yeast two 
hybrid experiment was found to contain a fragment of a 
cDNA corresponding to a new gene with slight homology 
30 to neurexin II-alpha-b. The new gene was designated 
NHII. 

Partial nucleotide sequences for the fragment of 
cDNA cloned as an insert in pYMP6 are shown in SEQ ID 
NO: 54 (coding strand sequenced from one end of the 
35 insert of pYMP6 sequenced with forward primer) and and 
SEQ ID NO: 55 (non-coding strand sequenced from one 
. end of pYMP6 with reverse primer) . The plasmid pYMP6 
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was deposited in the Belgian Co-ordinated Collections 
of Microorganisms (BCCM) , Universiteit Gent, K. L. 
Ledeganckstraat 35, B-9000, Gent, Belgium on 21 May 
1999 under accession number LMBP 3932. The cDNA 
5 insert (approximately 1800bp) can easily be excised 
from this plasmid by digestion with the restriction 
enzymes EcoRI and Xhol . Alternatively the cDNA insert 
sequence can be amplified by PCR using primers 
corresponding to the sequences for the ends of the 

10 insert given in SEQ ID NOS : 54 and 55. 

The interaction between UNC-5 and the new gene 
with homology to neurexin II-alpha-b could be a 
critical event in signalling and hence compounds which 
modulate this interaction may potentially have 

15 pharmacological activity. 

3) New Gene with Mena homology (MHI) (pYMP17) 

A third plasmid identified during the yeast two 
hybrid experiment was found to contain a fragment of a 
20 cDNA encoding a protein sharing slight homology with 
the human mena protein. The new gene was designated 
MKI. 

Partial nucleotide sequences of the fragment of 
cDNA cloned as an insert in pYMP17 are shown in SEQ ID 

25 NO: 56, (coding strand sequenced from one end of the 
insert of pYMP sequenced with forward primer) and SEQ 
ID NO: 57 (non-coding strand sequenced from one end of 
pYMP with reverse primer) . An alignment between the 
amino acid sequence encoded by the insert of pYMP17 

30 and the mouse mena protein is shown in Figure 7. The 
plasmid pYMP17 was deposited in the Belgian Co- 
ordinated Collections of Microorganisms (BCCM) , 
Universiteit Gent, K. L. Ledeganckstraat 35, B-9000, 
Gent, Belgium on 21 May 1999 under accession number 

35 LMBP 3935. The cDNA insert (approximately lOOObp) can 
easily be excised from this plasmid by digestion with 
the restriction enzymes EcoRI and Xhol. Alternatively 
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the cDNA insert sequence can be amplified by PCR using 
primers corresponding to the sequences for the ends of 
the insert given in Fiqures 55A and 55B. 

C. elegans has at least one protein with homology 
5 to the new Mena homologue (MHI), encoded by the gene 
designated Y50D4 . Contig200 . The C. elegans gene, 
unc-34 (which maps with Y50D4) is known to suppress 
the axonal guidance defects induced by ectopic 
expression of the Netrin receptor UNC-5 (Colavita, A. 

10 et al., Dev. Biol., 194:72-85, 1998.). 

The interaction between UNC-5 and mena, members 
of this mena superfamily, unc-34, and Y50D4 . contig200, 
could be a critical event in signalling and hence 
compounds which modulate these interactions may 

15 potentially have pharmacological activity and thus may 
be useful in the development of pharmaceutical 
compositions . 

4) Alpha-2 macroglobulin (pYMP30) 

20 A fourth plasmid identified during the yeast two 

hybrid experiment was found to contain a fragment of 
the human alpha-2 macroglobulin cDNA. 

The nucleotide sequence of the full length alpha- 
2 macroglobulin cDNA is shown in SEQ ID NO: 58, the 

25 amino acid sequence of the protein encoded by this 

cDNA is given in SEQ ID NO: 59. A partial nucleotide 
sequence for the fragment of the alpha-2 macroglobulin 
cDNA cloned as an insert in pYMP30 is shown in SEQ ID 
NO: 60. 

30 C. elegans has at least one homologue of alpha-2 

macroglobulin, designated ZK337.1, of which two splice 
variants designated ZK337.1a and ZK337.1b are known to 
exist . 

The interaction between UNC-5 and alpha-2 
35 macroglobulin could be a critical event in signalling 
and hence compounds which modulate this interaction 
may potentially have pharmacological activity. 
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Compounds which enhance or inhibit the interaction of 
UNC-5 with alpha-2 macroglobulin could be useful in 
the development of pharmaceutical substances. 

5 5) New gene 1 (pYMPll) 

A fifth plasmid identified during the yeast two 
hybrid experiment was found to contain a fragment of 
cDNA with no homology to any known human cDNA. 

Partial nucleotide sequences for the fragment of 

10 cDNA cloned as an insert in pYMPll are shown in SEQ ID 
NO: 61 (coding strand sequenced from one end of the 
insert of pYMP sequenced with forward primer) and SEQ 
ID- NO: 62 (non-coding strand sequenced from one end of 
pYMP with reverse primer) . The plasmid pYMPll was 

15 deposited in the Belgian Co-ordinated Collections of 
Microorganisms (BCCM) , Universiteit Gent, K. L. 
Ledeganckstraat 35, B-9000, Gent, Belgium on 21 May 
1999 under accession number LMBP 3933* The cDNA 
insert (approximately 2300bp) can easily be excised 

20 from this plasmid by digestion with the restriction 

enzymes EcoRI and Xhol . Alternatively the cDNA insert 
sequence can be amplified by PCR using primers 
corresponding to the sequences for the ends of the 
insert given in Figures 59A and 59B* 

25 The interaction between UNC-5 and the protein 

encoded by the insert of pYMPll could be a critical 
event in signalling and hence compounds which modulate 
this interaction may potentially have pharmacological 
activity and thus could be useful in the development 

30 of pharmaceutical substances. 

6) New gene 2 (pYMP12) 

A sixth plasmid identified during the yeast two 
35 hybrid experiment was found to contain a fragment of 
cDNA with no homology to any known human cDNA. 

Partial nucleotide sequences for the fragment of 
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cDNA cloned as an insert in pYMP12 are shown in SEQ ID 
NO: 63 (coding strand sequenced from one end of the 
insert of pYMP sequenced with forward primer) and SEQ 
ID NO: 64 (non-coding strand sequenced from one end of 
5 pYMP with reverse primer) . The plasmid pYMP12 was 

deposited in the Belgian Co-ordinated Collections of 
Microorganisms (BCCM) , Universiteit Gent, K. L. 
Ledeganckstraat 35, B-9000, Gent, Belgium on 21 May 
1999 under accession number LMBP 3934 . The cDNA 

10 insert (approximately 2000bp) can easily be excised 
from this plasmid by digestion with the restriction 
enzymes EcoRI and Xhol . Alternatively the cDNA insert 
sequence can be amplified by PCR using primers 
corresponding to the sequences for the ends of the 

15 insert given in Figures 60A and 60B. 

The interaction between UNC-5 and the protein 
encoded by the insert of pYMP12 could be a critical 
event in signalling and hence compounds which modulate 
this interaction may potentially have pharmacological 

20 activity and thus could be useful in the development 
of pharmaceutical substances. 

Example 5 

Yeast two hybrid compound screens 

25 Interactions of proteins leads to expression of a 

reporter protein p-galactosidase in a yeast two hybrid 
assay. An assay has been developed that is usable in 
96 or 384 well plates or microtiter plates with 
another number of wells. This assay is suitable for 

30 high throughput compound screening. Optimal 

performance of the assay is dependent upon at least 
two important parameters: 

lysis of yeast cells and the choice of the 
(J-galactosidase substrate . 
35 The basic protocol for an assay in 96 or 384 well 

plates is as follows: 

A yeast strain containing the Escherichia coli 
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lacZ gene under the control of the yeast Gal4 promoter 
is grown overnight (with shaking at 230-270 rpm) then 
diluted with YPD medium to an OD600 of 0.2. Diluted 
cultures are grown for an additional 3-5 hr until 
5 mid-log phase. Yeast cells are then transferred to 
either 96- or 384-well plates (100 jil/well or 25 
pl/well, respectively) . Alternatively , cells can be 
cultured in the microtiter plates, eliminating the 
need for a pipetting step. 

10 The yeast cells are then either lysed by freeze 

and thaw method (liquid N 2 to freeze, 37°C water bath 
to thaw) or by use of a Lysis buffer (e.g.: 1% Lithium 
dodecyl sulphate, 100 mM EDTA and 10 mM Tris-HCl pH 
8.0). Non-lysed cells also give a signal, although the 

15 variability is increased if the cells are not lysed. 
Yeast cells can also be permeabilized with various 
reagents such as isopropanol (15 %). 

The substrate sensitivity must be optimised for 
efficient detection in a screening process. 

20 Fluorescein di galactoside (FDG) is a typical low cost 
fluorescent reagent for the detection of 
(3-galactosidase; it can be used for screening, 
although autof luorescent compounds can induce a non- 
desirable background leading to false positives. 

25 Alternative substrates are available that become 

luminescent upon (3-galactosidase cleavage, thereby 
eliminating background problems. An example of such a 
substrate Galacton-Star® from Tropix. Typically about 
IpM substrate is added and the plates are incubated at 

30 room temperature for 60 minutes. Fluorescence (for 
FDG) is then measured at 530 nm. It is typically 
possible to detect as low as 100 cells per well. 

As an alternative to the use of (5-galactosidase, 
secreted alkaline phosphatase can be used as a 

35 reporter gene. The use of secreted alkaline 

phosphatase gives equivalent sensitivity to p- 
galactosidase with the advantage that there is no need 
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10 



15 



20 



25 



30 



to lyse the cells. Fluorescent substrates for alkaline 
phosphatase are available commercially from Sigma- 
Aldrich (Bornem, Belqium) or Molecular Probes (Eugene, 
OR, USA) . 

The test compound can be added at various stages 
of the above procedure. Generally, the compound is 
added on the plates onto which the yeast are plated. 
However, the compound can also be added during the 
second incubation in order to overcome toxicity 
ptoblems. As a control, it is important to check 
whether the compound slows down the growth of the 
yeast. This can be done using turbidity measurements. 



Example 6 



Detection of in vivo protein-protein interactions 
using fluorescence energy transfer (FRET) . 

An in vivo FRET assay can be conveniently 
performed using two different mutants of GFP which 
absorb and emit light at different wavelengths and 
which have suitably overlapping emission/absorption 
spectra, such as EGFP (enhanced green fluorescent 
protein) and EBFP (enhanced blue fluorescent protein) . 
When two such variant GFPs are brought into close 
proximity, within a few nanometers distance, 
fluorescence energy transfer (FRET) can be detected. 
Such transfer is characterized by a reduction of 
fluorescence intensity of the donor fluorophore (EBFP) 
and re-emission of fluorescence at the acceptor 
fluorophore (EGFP) wavelengths. Therefore if each 
fluorophore is fused to a protein domain known to bind 
to the other the protein-protein interaction can be 
monitored in vivo using FRET. 

In a typical example, the APC binding domain of 
UNC-5 cloned in fusion with EBFP, whereas APC is 
cloned in fusion with EGFP in expression vectors 
suitable for use in the chosen host cell line or 
organism. When both fusion proteins are expressed in 
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a cell line or in C. elegans it is possible to monitor 
and quantify their in vivo interaction by irradiating 
the cells/worms with light at 488nm. When the donor 
and acceptor fluorophore are brought into close 
5 proximity by binding of the two fusion proteins 

fluorescent energy transfer results in a measurable 
decreased in fluorescence from the fluorescence donor 
at a wavelength within the emission spectrum of the 
donor. In simple terms, what is measured is a 

10 quenching phenomenon since light emitted by the donor 
fluorophore is trapped by the acceptor fluorophore. 
NB- The experiment could also be performed by 
measuring fluorescence from the acceptor fluorophore 
but this is often less sensitive. 

15 Plasmid vectors containing both EGFP and EBFP are 

commercially available from Clontech (Palo Alto, 
California, USA) . Information on the use of these 
vectors is also supplied by the manufacturer. 

20 

Example 7 

Genetic and complementation screens in yeast. 

UNC-5 expression in yeast cells results in a 
lethal phenotype, mainly because of the expression of 

25 a death domain. This observation was most clearly seen 
in the experiments with C. elegans UNC-5. Accordingly, 
assays can be developed to screen for compounds, 
interacting proteins and suppressors which alter the 
activity of UNC-5, particularly the activity of the 

30 death domain of UNC-5 . These assays are analogous to 
those described by Xu and Reed (Mol. Cell 1998, 1:337- 
46) . 

(A) Compound screens. 
35 Yeast cells are transfected with a plasmid 

encoding the C. elegans or human unc-5 (including the 
death domain)., such as the vectors described in the 

i 
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yeast two hybrid experiments. The transfected yeast 
cells are then placed in the wells of rnicotiter 
plates.- <=*r»H *rf> exposed to the compounds under test. 
Compounds which reduce or inhibit the lethal phenotype 
5 of the yeast cells transfected with unc-5 are scored 
as hits. Such compounds will typically suppress the 
unc-5 lethal phenotype by interacting with UNC-5 
itself, or with UNC-5 interacting proteins, or with 
proteins in the UNC-5 pathway, or with proteins in 
10 parallel pathways. The selected compounds can be used 
in the development of pharmaceutical preparations. 

(B) Suppressor screens. 

Yeast cells are transfected with a plasmid 

15 encoding the C. elegans or human unc-5 (including the 
death domain) , such as the vectors described in the 
yeast two hybrid experiments. Furthermore, the yeast 
cells are transfected with a library expressing C. 
elegans or human cDNA, such as the libraries described 

20 in the yeast two hybrid experiments. The transfected 
yeast cells are placed in the wells of rnicotiter 
plates, and allowed to grow further. This allows 
selection cDNAs, and hence genes and proteins, that 
reduce or inhibit the lethal phenotype of the yeast 

25 cells transfected with the death domain of unc-5. 

Such proteins will interact with UNC-5, or with UNC-5 
interacting proteins, or with proteins in the UNC-5 
pathway, or with proteins in parallel pathways to 
cause suppression of the unc-5 lethal phenotype. 

30 The selected cDNAs genes and proteins can be used in 
the development of pharmaceutical preparations or in 
the development of assays to select for compounds that 
enhance their function or expression. 

35 

Example 8 

Cloning of a C. elegans gene startin g from a C. 
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eleaans insert. 

If a fragment of a given gene or cDNA is known 
then further fragments of the corresponding full 
length gene and/or cDNA can be constructed can often 
5 be found using in silico techniques such as AceDB (see 
http:Wwww.sanger.ac.uk), or searching of the EST 
database. The full cDNA can be cloned using standard 
technology such as 5'/3' RACE or SL1/2 RT-PCR on worm 
total RNA and colony hybridization. An analogous 
10 strategy is followed to clone a full length gene 
and/or cDNA for vertebrate and hence Human DNA. 

Example 9 

Cloning of C. eleaans gene starting from a human 

15 insert . 

A full length C. elegans gene can be cloned 
starting from a human sequence. Using in silico 
techniques, a homologue or an EST can be found. 
Standard molecular biology techniques can then be used 

20 to clone the full length C. elegans gene. If no 

homologous sequence can be found by simple database 
searching, it may be necessary to perform species 
hopping. An analogous strategy is followed to clone a 
full length gene and/or cDNA for vertebrate and hence 

25 Human DNA, starting from a C. elegans DNA sequence. 
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SEQUENCE LISTING 

SEQ ID NO: 1 nucleotide sequence of a part of the 
human unc-5Cb cDNA which encodes the 
5 intracellular region of the protein. 

SEQ ID NO: 2 amino acid sequence of the 

intracellular part of the human unc-5Cb 
protein encoded by the nucleotide 
10 sequence shown as SEQ ID NO: 1. 

SEQ ID NO: 3 nucleotide sequence of a part of the 
human unc-5Cc cDNA which encodes the 
intracellular region of the protein. 

15 

SEQ ID NO: 4 amino acid sequence of the 

intracellular part of the human unc-5Cc 
protein encoded by the nucleotide 
sequence shown as SEQ ID NO: 3. 

20 

SEQ ID NO: 5 nucleotide sequence of a part of the 
human unc-5C8 cDNA which encodes the 
intracellular region of the protein. 

25 SEQ ID NO: 6 amino acid sequence of the 

intracellular part of the human unc-5C8 
protein encoded by the nucleotide 
sequence shown as SEQ ID NO: 5. 

30 SEQ ID NO: 7 nucleotide sequence of the fragment of 

the human unc-5Hl cDNA cloned by PCR in 
Example 2. 

SEQ ID NO: 8 predicted amino acid sequence for the 
35 human unc-5Hl protein, translation in 

frame 1 . 
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SEQ ID NO: 9 predicted amino acid sequence for the 
human unc-5Hl protein, translation in 
frame 2 . 

5 SEQ ID NO: 10 predicted amino acid sequence for the 

human unc-5Hl protein, translation in 
frame 3. 

SEQ ID NO: 11 nucleotide sequence of the C. elegans 
10 spectrin (J-chain/Fodrin cDNA. 

SEQ ID NO: 12 amino acid sequence of the C. elegans 
spectrin p-chain/Fodr in protein. 

15 SEQ ID NO: 13 nucleotide sequence of the fragment of 

the C. elegans spectrin (3-chain/Fodrin 
cDNA cloned in pC1025. 

SEQ ID NO: 14 amino acid sequence of the polypeptide 
20 encoded by the cDNA fragment shown as 

SEQ ID NO: 13. 

SEQ ID NO: 15 nucleotide sequence of the C. elegans 
APR-1 cDNA. 

25 

SEQ ID NO: 16 amino acid sequence of the C. elegans 
APR-1 protein. 

SEQ ID NO: 17 nucleotide sequence of a fragment of 
30 the C. elegans APR-1 cDNA cloned in 

pC1028. 

SEQ ID NO: 18 amino acid sequence of the polypeptide 
encoded by the cDNA fragment shown as 
35 SEQ ID NO: 17. 

SEQ ID NO: 19 nucleotide sequence of the C. elegans 
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unc-14 cDNA. 

amino acid sequence of the C. elegans 
unc-14 protein. 

nucleotide sequence of the fragment of 
the C. elegans unc-14 cDNA cloned in 
pC1034. 

amino acid sequence of the polypeptide 
encoded by the cDNA fragment shown as 
SEQ ID NO: 21. 

nucleotide sequence of the C. elegans 
F11A10.1 cDNA. 

amino acid sequence of the C. elegans 
F11A10.1 protein. 

nucleotide sequence of the fragment of 
the C. elegans F11A10.1 cDNA cloned in 
pGC1021. 



WO 00/73328 

SEQ ID NO: 90 

5 

SEQ ID NO: 21 

10 SEQ ID NO: 22 

SEQ ID NO: 23 

15 

SEQ ID NO: 24 

20 SEQ ID NO: 25 

SEQ ID NO: 26 

25 

SEQ ID NO: 27 

30 

SEQ ID NO: 28 
SEQ ID NO: 29 

35 



amino acid sequence 
encoded by the cDNA 
SEQ ID NO: 25. 



of the polypeptide 
fragment shown as 



nucleotide sequence 
C15E6.1 cDNA. 

amino acid sequence 
C15E6.1 protein. 

nucleotide sequence 
the C. elegans C15E6 
pGC1026. 



of the C. elegans 

of the C. elegans 

of the fragment of 
1 cDNA cloned in 
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SEQ ID NO: 30 amino acid sequence of the polypeptide 
encoded by the cDNA fragment shown as 
SEQ ID NO: 29. 

5 SEQ ID NO: 31 nucleotide sequence of the C. elegans 

D1081.7 cDNA. 



SEQ ID NO: 32 amino acid sequence of the C. elegans 

D1081.7 protein. 

10 

SEQ ID NO: 33 nucleotide sequence of the fragment of 
the C. elegans 1081.7 cDNA cloned in 
pGC1027 . 

15 SEQ ID NO: 34 amino acid sequence of the polypeptide 

encoded by the cDNA fragment shown as 
SEQ ID NO: 33. 

SEQ ID NO: 35 nucleotide sequence of the C. elegans 
20 B0238.9 cDNA (seu-2) . 

SEQ ID NO: 36 amino acid sequence of the C. elegans 
B0238.9 protein (seu-2). 

25 SEQ ID NO: 37 nucleotide sequence of the fragment of 

the C. elegans B0238.9 cDNA cloned in 
pGC1023. 

SEQ ID NO: 38 amino acid sequence of the polypeptide 
30 encoded by the cDNA fragment shown as 

SEQ ID NO: 37. 

SEQ ID NO: 39 nucleotide sequence of the C. elegans 
ZC404.8 cDNA. 



35 



SEQ ID NO: 40 amino acid sequence of the C. elegans 
ZC404.8 protein. 
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SEQ ID NO: 42 

5 

SEQ ID NO: 43 

10 

SEQ ID NO: 44 

15 

SEQ ID NO: 45 
SEQ ID NO: 46 

20 

SEQ ID NO: 47 

25 

SEQ ID NO: 48 



30 SEQ ID NO: 49 



SEQ ID NO: 50 

35 
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nucleotide sequence of the C. elegans 
ZC404.8 cDNA cloned in pGC1033. 

amino acid sequence of the polypeptide 
encoded by the cDNA fragment shown as 
SEQ ID NO: 41. 

nucleotide sequence of the fragment of 
the C. elegans ykl7a3 cDNA cloned in 
pGC1023. 

amino acid sequence of the polypeptide 
encoded by the cDNA fragment shown as 
SEQ ID NO: 43. 

nucleotide sequence of the C. elegans 
F41H10.3 cDNA. 

amino acid sequence of the C. elegans 
F41H10.3 protein. 

nucleotide sequence of the fragment of 
the C. elegans F41H10.3 cDNA cloned in 
pGC1020. 

amino acid sequence of the polypeptide 
encoded by the cDNA fragment shown as 
SEQ ID NO: 47. 

nucleotide sequence of the human i- 
beta-1, 3-N-acetyiaminyltransf erase 
cDNA. 

amino acid sequence of the human i- 
beta-1, 3-N-acetyiaminyltransf erase 
protein . 



BNSOOC1D: <WO 0073328A2 I > 



WO 00/73328 PCI7EP00/05108 

- 62 - 

SEQ ID NO: 51 partial nucleotide sequence for the 
fragment of the human i-beta-1 , 3-N- 
acetylaminyltransf erase cDNA cloned in 
pYMP5 (forward primer, coding strand) . 

5 

SEQ ID NO: 52 partial nucleotide sequence for the 
fragment of the human i-beta-1 , 3-N- 
acetylaminyltransf erase cDNA cloned in 
pYMPS (reverse primer, non-coding 
10 strand) 

SEQ ID NO: 53 partial amino acid sequence for the 

polypeptide encoded by the fragment of 
the i-beta-1, 3-N- 
15 acetylaminyltransf erase cDNA cloned in 

pYMPS. 

SEQ ID NO: 54 partial nucleotide sequence for the 
human cDNA fragment cloned in pYMP6 
20 (forward primer, coding strand) . 

SEQ ID NO: 55 partial nucleotide sequence for the 
human cDNA fragment cloned in pYMP6 
(reverse primer, non-coding strand) . 

25 

SEQ ID NO: 56 partial nucleotide sequence for the 
human cDNA fragment cloned in pYMP17 
(forward primer, coding strand) . 

30 SEQ ID NO: 57 partial nucleotide sequence for the 

human cDNA fragment cloned in pYMP17 
(reverse primer, non-coding strand) . 

SEQ ID NO: 58 nucleotide sequence of the human alpha- 
35 2-macroglobulin cDNA. 

SEQ ID NO: 59 amino acid sequence of the human alpha- 
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5 

SEQ ID NO: 61 

10 

SEQ ID NO: 62 

15 

SEQ ID NO: 63 
20 SEQ ID NO: 64 
SEQ ID NO: 65 

25 

SEQ ID NO: 66 

30 

SEQ ID NO: 67 
35 SEQ ID NO: 68 
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2-macroglobulin protein. 

partial nucleotide sequence for the 
fragment of the human alpha-2- 
macroglobulin cDNA cloned in pYMP30 
(reverse primer, non-coding strand) . 

partial nucleotide sequence of the 
fragment of human cDNA cloned in pYMPll 
(forward primer, coding strand) . 

partial nucleotide sequence of the 
fragment of human cDNA cloned in pYMPll 
(reverse primer, non-coding strand) . 

partial nucleotide sequence of the 
fragment of human cDNA cloned in pYMP12 
(forward primer, coding strand) . 

partial nucleotide sequence of the 
fragment of human cDNA cloned in pYMP12 
(reverse primer, non-coding strand) . 

amino acid sequence of the mouse APC-2 
cDNA. 

nucleotide sequence of a C. elegans I- 
beta-1, 3-N-acetylaminyltransf erase cDNA 
(F22F7. 6) . 

amino acid sequence of a C. elegans I- 
beta-1, 3-N-acetylaminyl transferase 
protein (F22F7. 6) . 

nucleotide sequence of the C. elegans 
alpha-2-macroglobulin cDNA ZK337.1a. 
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SEQ ID NO: 70 

5 

SEQ ID NO: 71 

10 SEQ ID NO: 72 

SEQ ID NO: 73 

15 

SEQ ID NO: 74 

20 

SEQ ID NO: 75 

25 

SEQ ID NO: 76 



30 SEQ ID NO: 77 



SEQ ID NO: 78 

35 
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nucleotide sequence of the C. elegans 
alpha-2-macroglobulin cDNA ZK337 . lb 

amino acid sequence of the C. elegans 
alpha-2-macroglobuIin protein ZK337.1a. 

amino acid sequence of the C. elegans 
alpha-2-macroglobulin protein ZK337 . lb . 

cDNA sequence for the C. elegans I- 
beta-1 , 3-N-acetylaminyltransf erase 
homologue C18C1 . 3 . 

amino acid sequence for the C. elegans 
I -bet a- 1 , 3-N-acetylaminyltransf erase 
homologue C18C1 . 3 . 

cDNA sequence for the C. elegans I- 
beta-1, 3 -N-acetylaminyl transferase 
homologue K09C8.4. 

amino acid sequence for the C. elegans 
I~beta-1 , 3-N-acetylaminyltransf erase 
homologue K09C8.4. 

amino acid sequence for the C\ elegans 
I -beta- 1 , 3-N-acetylaminyltransf erase 
homologue F21H7.10. 

cDNA sequence for the C. elegans IE- 
beta- 1, 3-N-acetylaminyltransf erase 
homologue C54C8 . 2 . 

amino acid sequence for the C. elegans 
I-beta-1, 3-N-acetylaminyltransf erase 
homologue C54C8.2. 
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SEQ ID NO: 79 



5 SEQ ID NO: 80 



SEQ ID NO: 81 

10 

SEQ ID NO: 82 

15 

SEQ ID NO: 83 

20 

SEQ ID NO: 84 



25 SEQ ID NO: 85 



SEQ ID NO: 86 

30 

SEQ ID NO: 87 

35 

SEQ ID NO: 88 
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cDNA sequence for the C. elegans I- 
beta-1, 3-N-acetylaminyltransf erase 
homologue F56H6 . 6 . 

amino acid sequence for the C. elegans 
I -beta- 1, 3-N-acetylaminyltransf erase 
homologue F56H6 . 6 . 

cDNA sequence for the C. elegans I- 
beta-1, 3-N-acetylaminyltransf erase 
homologue T15D6 . 4 . 

amino acid sequence for the C. elegans 
I-beta-1 , 3-N-acetylaminyltransf erase 
homologue T15D6.4. 

amino acid sequence of the 
extracellular part of the C. elegans 
unc-5 protein. 

amino acid sequence of the 
transmembrane region of the C. elegans 
unc-5 protein. 

amino acid sequence of the membrane 
proximal part of the C. elegans unc-5 
protein. 

amino acid sequence of the zonula 
occludens part of the C. elegans unc-5 
protein. 

amino acid sequence of a part of the C. 
elegans unc-5 protein of unknown 
function. 

amino acid sequence of the death domain 
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of the C. elegans unc-5 protein. 

SEQ ID NO: 89 amino acid sequence of the human HS1 
protein . 

5 

SEQ ID NO: 90 amino acid sequence of the human UNC5C 
protein . 

SEQ ID NO: 91 complete nucleotide sequence of plasmid 
10 pGC1037. 

SEQ ID NO: 92 complete nucleotide sequence of plasmid 
pGC1003. 

15 SEQ ID NO: 93 amino acid sequence of C. elegans unc- 

40. 

SEQ ID NO: 94 nucleotide sequence of C. elegans unc- 
40. 



20 



25 



SEQ ID NO: 95 amino acid sequence of human unc-4 0. 
SEQ ID NO: 96 nucleotide sequence of human un'c-4 0. 



ACCESSION NUMBERS: 

Human beta-fodrin cDNA-GenBank S65762 
30 Human beta-fodrin protein-swissprot Q01082 
Human APC-1 cDNA-GenBank M74088 
Human APC-1 protein-swissprot P25054 
Human unc-14 cDNA (KIAA0375) -GenBank AB002373 



35 
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Human unc-14 protein (KIAA0375) -BAA20830 
Human ykl7a3 cDNA (KIAA0187 ) -GenBank D80O09 
5 Human ykl7a3 protein (KIAA0187) -SPTREMBL: Q14692 



TABLE 1 : Schematic representation of dimerisations of C. elefians unc-5. using 
constructions in pAS2 and pGAD424 
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Claims : 

1. A protein comprising the sequence of amino 
acids set forth in SEQ ID NO: 2 or a sequence of amino 

5 acids which differs from that set forth in SEQ ID NO: 
2 only in conservative amino acid changes. 

2. A nucleic acid comprising a sequence of 
nucleotides which encodes the protein claimed in claim 

10 1. 

3. A nucleic acid comprising the sequence of 
nucleotides set forth in SEQ ID NO: 1 or a fragment 
thereof . 

15 

4. An expression vector comprising the nucleic 
acid of claim 2 or claim 3. 

5. A host cell or organism transformed or 
20 transfected with the expression vector of claim 4. 

6. An antibody which is capable of specifically 
binding to the protein claimed in claim 1 or an 
epitope thereof. 



25 



30 



7. A protein comprising the sequence of amino 
acids set forth in SEQ ID NO: 4 or a sequence of amino 
acids which differs from that set forth in SEQ ID NO: 

4 only in conservative amino acid changes. 

8. A nucleic acid comprising a sequence of 
nucleotides which encodes the protein claimed in claim 
7. 



35 9. A nucleic acid comprising the sequence of 

nucleotides set forth in SEQ ID NO: 3 or a fragment 
thereof . 
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10. An expression vector comprising the nucleic 
acid of claim 8 or claim 9. 

11. A host cell or organism transformed or 

5 transfected with the expression vector of claim 10. 

12. An antibody which is capable of specifically 
binding to the protein claimed in claim 7 or an 
epitope thereof. 

10 

13. A protein comprising the sequence of amino 
acids set forth in SEQ ID NO: 6 or a sequence of amino 
acids which differs from that set forth in SEQ ID NO: 

6 only in conservative amino acid changes. 

15 

14. A nucleic acid comprising a sequence of 
nucleotides which encodes the protein claimed in claim 
13. 

20 15. A nucleic acid comprising the sequence of 

nucleotides set forth in SEQ ID NO: 5 or a fragment 
thereof. 

16. An expression vector comprising the nucleic 
25 acid of claim 13 or claim 14. 

17. . A host cell or organism transformed or 
transfected with the expression vector of claim 16. 

30 18. An antibody which is capable of specifically 

binding to the protein claimed in claim 13 or an 
epitope thereof. 

19. A method of identifying compounds which are 
35 capable of inhibiting or enhancing the binding of an 
UNC-5 protein to an interacting protein previously 
identified as binding to the said UNC-5 protein, which 
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method comprises: 

providing a host ceil containing a DNA 
construct comprising a reporter gene operatively 
linked to a promoter regulated by a transcription 
factor having a DNA binding domain and an 
activating domain; 

expressing in said host cell a first hybrid 
DNA sequence encoding a first fusion protein 
comprising an UNC-5 protein or a fragment thereof 
fused in-frame to either the DNA binding domain 
or the activating domain of the said 
transcription factor; 

expressing in said host cell a second hybrid 
DNA sequence encoding a second fusion protein 
comprising an interacting protein or a fragment 
thereof fused in-frame to either the DNA binding 
domain or the activating domain of the said 
transcription factor, such that when the first 
fusion protein comprises the activation domain of 
the said transcription factor the second fusion 
protein comprises the DNA binding domain of the 
said transcription factor and when the first 
fusion protein comprises the DNA binding domain 
of the transcription factor the second fusion 
protein comprises the activation domain; 

contacting the host cell with a sample of 
the compound under test; and 

detecting any binding of the UNC-5 protein 
or fragment thereof to the interacting protein or 
fragment thereof by detecting the production of 
any reporter gene product in the said host cell. 

20. A method of identifying compounds which are 
capable of inhibiting or enhancing the binding of an 
UNC-5 protein to an interacting protein previously 
identified as binding to the said UNC-5 protein, which 
method comprises: 
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10 



providing a transgenic cell or organism 
expressing a first fusion protein comprising an 
UNC-5 protein or a fragment thereof fused in- 
frame to a first genetically encoded fluorophore 
and a second fusion protein comprising an 
interacting protein or a fragment thereof fused 
in-frame to a second genetically encoded 
fluorophore, the first and second fluorophores 
being characterised in that the emission spectrum 
of one of the fluorophores overlaps with the 
absorption spectrum of the other fluorophore; 

measuring the amount of fluorescence emitted 
from the fluorophore having an emission spectrum 
which overlaps with the absorption spectrum of 
15 the other fluorophore; 

exposing the transgenic cell or organism to 
a compound under test; and 

detecting any change in the amount of 
fluorescence emitted fluorescence emitted from 
20 the fluorophore having an emission spectrum which 

overlaps with the absorption spectrum of the 
other fluorophore . 

21. A method of identifying compounds which are 
25 capable of inhibiting or enhancing the binding of an 
UNC-5 protein to an interacting protein previously 
identified as binding to the said UNC-5 protein, which 
method comprises: 

providing a first reaction component 
30 comprising a first protein linked to a solid 

support containing a scintillant and a second 
reaction component comprising a second protein 
which has been radioactively labelled, wherein 
the first and second proteins are an UNC-5 
35 protein or a fragment thereof and an interacting 

protein or a fragment thereof; 

bringing the first and second reaction 
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components into contact in an aqueous solution in 
the presence of a compound under test; and 

detecting binding of the first protein to 
the second protein by detecting light emission 
5 from the scintillant. 

22. A method of identifying compounds which are 
capable of inhibiting or enhancing the binding of an 
UNC-5 protein to an interacting protein previously 

10 identified as binding to the said UNC-5 protein, which 
method comprises: 

coating the wells of a microtiter plate with 
UNC-5 protein or a fragment thereof; 

contacting the UNC-5 protein or fragment 
15 thereof with an aqueous solution comprising an 

interacting protein or a fragment thereof, said 
interacting protein being labelled with a tag 
which is directly or indirectly detectable, and a 
compound under test; 
20 washing to remove the compound under test 

and any unbound tagged interacting protein; and 
detecting complexes of UNC-5 or a fragment 
thereof bound to the interacting protein or a 
fragment thereof by directly or indirectly 
25 detecting the presence of the tag. 

23. A method of identifying compounds which are 
capable of inhibiting or enhancing the binding of an 
UNC-5 protein to an interacting protein previously 

30 identified as binding to the said UNC-5 protein, which 
method comprises: 

exposing a cell or organism expressing UNC-5 
and overexpressing nucleic acid encoding an 
interacting protein to the compound under test; 
35 and 

screening for reversion of the 
overexpression phenotype of the cell or organism 



BNSOOCIO: <WO__0073328A2J_> 



WO 00/73328 



- 73 - 



PCT/EP00/05108 



to wild-type. 



24. A method as claimed in claim 23 wherein the 
organism is a nematode worm. 

5 

25. A method as claimed in claim 24 wherein the 
nematode worm is C. elegans. 

26. A method as claimed in claim 23 wherein the 
10 cell is a mammalian cell line. 



27. A method as claimed in any one of claims 23 
to 26 wherein the cell or organism further expresses a 
reporter gene encoding a reporter protein. 

15 

28. A method as claimed in claim 27 wherein the 
reporter protein is a fluorescent protein or a 
luminescent protein. 

20 29. A method as claimed in any one of claims 19 

to 28 wherein the UNC-5 protein is a C. elegans UNC-5 
protein. 

30. A method as claimed in any one of claims 19 
25 to 28 wherein the UNC-5 protein is a human UNC-5 

protein. 

31. A method as claimed in claim 30 wherein the 
human UNC-5 protein is UNC-5C or a protein as claimed 

30 in any one of claims 1, 7, 13 or 71. 

32. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein is a C. elegans 
UNC-5 protein or a fragment thereof. 

35 

33. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein is C. elegans 
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UNC-40. 

34. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein is human UNC-40. 

5 

35. A method as claimed in claim 34 wherein the 
UNC-4 0 protein comprises the sequence of amino acids 
set forth in SEQ ID NO: 95. 

10 36. A method as claimed in any one of claims 29 

to 31 wherein the interacting protein is a C. elegans 
spectrin p-chain/f odrin protein. 

37. A method as claimed in claim 36 wherein the 
15 spectrin p-chain/f odrin protein comprises the sequence 

of amino acids set forth in SEQ ID NO: 12. 

38. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein is C. elegans 

20 APR-1. 

39. A method as claimed in claim 38 wherein the 
C. elegans APR-1 protein comprises the sequence of 
amino acids set forth in SEQ ID NO: 16. 

25 

40. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein is C. elegans 
UNC-14 . 

30 41. A method as claimed in claim 40 wherein the 

C. elegans UNC-14 protein comprises the sequence of 
amino acids set forth in SEQ ID NO: 20. 

42. A method as claimed in any one of claims 29 
35 to 31 wherein the interacting protein comprises the 
seauence of amino acids set forth in SEQ ID NO: 24. 
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43. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein comprises the 
sequence of amino acids set forth in SEQ ID NO: 28. 

44. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein comprises the 
sequence of nucleotides set forth in SEQ ID NO: 32. 

45. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein comprises the 
sequence of amino acids set forth in SEQ ID NO: 36. 

46. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein comprises the 
sequence of amino acids set forth in SEQ ID NO: 40. 

47. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein comprises the 
sequence of amino acids set forth in SEQ ID NO: 44. 

48. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein comprises the 
sequence of amino acids set forth in SEQ ID NO: 46. 

49. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein is a human UNC-5 
protein. 

50. A method as claimed in claim 4 9 wherein the 
human UNC-5 protein is UNC-5C or a protein as claimed 
in any one of claims 1, 7, 13 or 71. 

51. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein is human i-beta- 
1, 3-N-acetylaminyltransf erase. 

52. A method as claimed in claim 51 wherein the 
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human i-beta-1, 3-N-acetylaminyltransf erase comprises 
the sequence of amino acids set forth in SEQ ID NO: 
50. 

5 53. A method as claimed in any one of claims 29 

to 31 wherein the interacting protein comprises a 
protein encoded by the nucleic acid of claim 72 or 
claim 73. 

10 54. A method as claimed in any one of claims 29 

to 31 wherein the interacting protein comprises a 
protein encoded by the nucleic acid of claim 74 or 
claim 75. 

15 55. A method as claimed in any one of claims 29 

to 31 wherein the interacting protein is human alpha-2 
macroglobulin . 

56. A method as claimed in claim 55 wherein the 
20 alpha-2 macroglobulin comprises the sequence of amino 

acids set forth in SEQ ID NO: 59. 

57. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein comprises a 

25 protein encoded by the nucleic acid of claim 76 or 
claim 77. 

58. A method as claimed in any one of claims 29 
to 31 wherein the interacting protein comprises a 

30 protein encoded by the nucleic acid of claim 78 or 
claim 79. 

59. A method of identifying compounds which 
reduce or inhibit the lethal phenotype associated with 

35 the expression of the UNC-5 death domain in yeast, 
which method comprises: 

exposing a yeast cell containing an 
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expression vector comprising nucleic acid 
encoding an UNC-5 protein or a fragment thereof 
comprising the death domain to a compound under 
test; 

allowing the yeast cells to grow in the 
presence of the compound; and 

screening for a reduction or inhibition of 
the lethal phenotype associated with the 
expression of the UNC-5 death domain in yeast. 

60. A method as claimed in claim 59 wherein the 
UNC-5 protein is a C. elegans UNC-5 protein. 

61. A method as claimed in claim 59 wherein the 
15 UNC-5 protein is a human UNC-5 protein. 

62. A method as claimed in claim 61 wherein the 
human UNC-5 protein is a protein as claimed in any one 
of claims 1, 7 or 13 or 71. 

20 

63. A method of identifying suppressers of the 
lethal phenotype associated with the expression of the 
UNC-5 death domain in yeast, which method comprises: 

transfecting yeast cells containing an 
25 expression vector comprising nucleic acid 

encoding an UNC-5 protein or a fragment thereof 
comprising the death domain with a cDNA library 
cloned in a yeast expression vector; 

allowing the transfected yeast cells to grow 
30 for one or more cell divisions; and 

screening for reduction or inhibition of the 
lethal phenotype associated with the expression 
of the UNC-5 death domain in yeast. 

35 64 . A method as claimed in claim 63, which 

method further comprises the steps of: 

identifying a transfected yeast cell 
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exhibiting a reduction or inhibition of the 
lethal phenotype associated with the expression 
of the UNC-5 death domain in yeast; and 

isolating the cDNA clone (s) present in the 
5 transfected yeast cell which is/are responsible 

for conferring reduction or inhibition of the 
lethal phenotype. 

65. A method as claimed in claim 63 or claim 64 
10 wherein the UNC-5 protein is a C. elegans UNC-5 

protein . 

66. A method as claimed in claim 63 or claim 64 
wherein the UNC-5 protein is a human UNC-5 protein. 

15 

67. A method as claimed in claim 66 wherein the 
human UNC-5 protein is a protein as claimed in any one 
of claims 1, 7, 13 or 71. 

20 68 . A method as claimed in claim 65 wherein the 

cDNA library is a C. elegans cDNA library. 

69. A method as claimed in claim 66 or claim 67 
wherein the cDNA library is a human cDNA library. 

25 

70. A nucleic acid comprising the sequence of 
nucleotides set forth in SEQ ID NO: 7. 

71. A protein comprising a sequence of amino 

30 acids encoded by the nucleic acid molecule of claim 8. 

72. A nucleic acid which is obtainable by 
restriction enzyme digestion of the plasmid pYMP17 
with the restriction enzymes EcoRI and Xhol. 



35 



73. A nucleic acid as claimed in claim 72 which 
comprises the sequence of nucleotides set forth in SEQ 
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ID NO: 56 and a sequence of nucleotides complementary 
to the sequence of nucleotides set forth in SEQ ID NO: 
57. 



5 7 4. A nucleic acid which is obtainable by 

restriction enzyme digestion of the plasmid pYMP6 with 
the restriction enzymes EcoRI and Xhol . 

75. A nucleic acid as claimed in claim 74 which 
10 comprises the sequence of nucleotides set forth in SEQ 
ID NO: 54 and a sequence of nucleotides complementary 
to the sequence of nucleotides set forth in SEQ ID NO: 
55. 

15 76. A nucleic acid comprising the sequence of 

nucleotides set forth in SEQ ID NO: 61 and a sequence 
of nucleotides complementary to the sequence of 
nucleotides set forth in SEQ ID NO: 62. 

20 77. A nucleic acid as claimed in claim 76 which 

is obtainable by restriction enzyme digestion of the 
plasmid pYMPll with the restriction enzymes EcoRI and 
Xhol. 



78. A nucleic acid comprising the sequence of 
nucleotides set forth in SEQ ID NO: 63 and a sequence 
of nucleotides complementary to the sequence of 
nucleotides set forth in SEQ ID NO: 64. 

79. A nucleic acid as claimed in claim 78 which 
is obtainable by restriction enzyme digestion of the 
plasmid pYMP12 with the restriction enzymes EcoRI and 
Xhol. 



80. A nucleic acid probe which is capable of 
hybridizing to the nucleic acid of claim 70 under 
conditions of high stringency. 
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81. An oligonucleotide comprising a sequence of 
10 or more consecutive nucleotides of the sequence of 
nucleotides set forth in SEQ ID NO: 7. 

5 82. An antisense nucleic acid which is capable 

of hybridizing to the sequence of nucleotides set 
forth in SEQ ID NO: 7 under conditions of high 
stringency . 

10 83. An expression vector comprising the nucleic 

acid of claim 70. 

84. A host cell or organism transformed or 
transfected with the expression vector of claim 83. 

15 

85. An antibody which is capable of specifically 
binding to the protein claimed in claim 71. 
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Symbol comparison table: blosum62 
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MSF: 


1599 Check 


0 














Name: 


UNC5C 


Len : 


1599 


Check : 


410 


Weight : 


0 


.76 


Name : 


UNC5C8 


Len: 


1599 


Check: 


1710 


Weight : 
Weight : 
Weight : 
Weight : 


0 


.76 


Name : 


UNC5CC 


Len: 


1599 


Check : 


5512 


1 


.12 


Name: 


UNCSCd (UNCSCb) 


Len: 


1599 


Check: 


1388 


1 


.37 


Name : 


Consensus 


Len: 


1599 


Check : 


7845 


4 


.00 



1 

UNC5C TTGTTTGTGT 
UNC5C8 TTGTTTGTGT 
UNCSCc TTGTTTGTGT 
UNCSCd TTGTTTGTGT 
Consensus TTGTTTGTGT 

51 

UNC5C CTCTTCGGCA 
UNC5C8 CTCTTCGGCA 
UNC5CC CTCTTCGGCA 
UNC5Cd CTCTTCGGCA 
Consensus CTCTTCGGCA 



ATCGGAAGAA TCATCGTGAC 
ATCGGAAGAA TCATCGTGAC 
ATCGGAAGAA TCATCGTGAC 
ATCGGAAGAA TCATCGTGAC 
ATCGGAAGAA TCATCGTGAC 



CTCAATGGGG GCTTTCAGCC 
CTCAATGGGG GCTTTCAGCC 
CTCAATGGGG GCTTTCAGCC 
CTCAATGGGG GCTTTCAGCC 
CTCAATGGGG GCTTTCAGCC 



50 

TTTGAGTCAG ATATTATTGA 
TTTGAGTCAG ATATTATTGA 
TTTGAGTCAG ATATTATTGA 
TTTGAGTCAG ATATTATTGA 
TTTGAGTCAG ATATTATTGA 

100 

TGTGAACATC AAGG CAGCAA 
TGTGAACATC AAGGCAGCAA 
TGTGAACATC AAGGCAGCAA 
TGTGAACATC AAGGCAGCAA 
TGTGAACATC AAGGCAGCAA 



101 150 

UNC5C GACAAGATCT GCTGGCTGTA CCCCCAGACC TCACGTCAGC TGCAGCCATG 

UNC5C8 GACAAGA CC TCACGTCAGC TGCAGCCATG 

UNC5CC GACAAGATCT GCTGGCTGTA CCCCCAGACC TCACGTCAGC TGCAGCCATG 

UNC5Cd GACAAGATCT GCTGGCTGTA CCCCCAGACC TCACGTCAGC TGCAGCCATG 

Consensus GACAAGAtct gctggctgta cccccagaCC TCACGTCAGC TGCAGCCATG 



151 

UNC5C TACAGAGGAC 
UNC5C8 TACAGAGGAC 
UNC5CC TACAGAGGAC 
UNC5Cd TACAGAGGAC 
Consensus TACAGAGGAC 



CTGTCTATGC CCTGCATGAC 
CTGTCTATGC CCTGCATGAC 
CTGTCTATGC CCTGCATGAC 
CTGTCTATGC CCTGCATGAC 
CTGTCTATGC CCTGCATGAC 



200 

GTCTCAGACA AAATCCCAAT 
GTCTCAGACA AAATCCCAAT 
GTCTCAGACA AAATCCCAAT 
GTCTCAGACA AAATCCCAAT 
GTCTCAGACA AAATCCCAAT 



201 

UNC5C GACCAACTCT 
UNC5C8 GACCAACTCT 
UNCSCC GACCAACTCT 
UNCSCd GACCAACTCT 
Consensus GACCAACTCT 



CCAATTCTGG ATCCACTGCC 
CCAATTCTGG ATCCACTGCC 
CCAATTCTGG ATCCACTGCC 
CCAATTCTGG ATCCACTGCC 
CCAATTCTGG ATCCACTGCC 



250 

CAACCTGAAA ATCAAAGTGT 
CAACCTGAAA ATCAAAGTGT 
CAACCTGAAA ATCAAAGTGT 
CAACCTGAAA ATCAAAGTGT 
CAACCTGAAA ATCAAAGTGT 



UNC5C 
UNC5C8 
UNCSCc 
UNCSCd 
Consensus 



UNC5C 
UNC5C8 
UNCSCc 
UNCSCd 
Consensus 



251 300 
ACAACACCTC AGGTGCTGTC TCCCCCCAAG ATGACCTCTC TGAGTTTACG 
ACAACACCTC AGGTGCTGTC ACCCCCCAAG ATGACCTCTC TGAGTTTACG 

ACAACACCTC AGGTGCTGTC ACC 

ACAACACCTC AAGTGCTGTC ACCCCCCAAG ATGACCTCTC TGAGTTTACG 
ACAACACCTC AgGTGCTGTC aCCccccaag atgacctctc tgagtttacg 

301 350 
TCCAAGCTGT CCCCTCAGAT GACCCAGTCG TTGTTGGAGA ATGAAGCCCT 
TCCAAG CTGT CCCCTCAGAT GACCCAGTCG TTGTTGGAGA ATGAAGCCCT 

TCCAAGCTGT CCCCTCAGAT GACCCAGTCG TTGTTGGAGA ATGAAGCCCT 
tccaagctgt cccctcagat gacccagtcg ttgttggaga atgaagccct 



BNSDOC1D: <WO 0073328A2_I_> 



SUBSTITUTE SHEET (RULE 26) 



WO 00/73328 



2/18 



PCT/EP00/05108 



351 



400 



UNC5C 
UNC5C8 
UNC5CC 
UNCSCd 
Consensus 



UNC5C 
UNC5C8 
UNCSCc 
UNCSCd 
Consensus 



UNC5C 
UNC5C8 
UNCSCc 
UNC5Cd 
Consensus 



UNC5C 
UNC5C8 
UNCSCc 
UNC5Cd 
Consensus 



UNC5C 
UNC5C8 
UNCSCC 
UNCSCd 
Consensus 



UNC5C 
UNC5C8 
UNCSCC 
UNCSCd 
Consensus 



UNC5C 
UNC5C8 
UNC5CC 
UNCSCd 
Consensus 



UNC5C 
UNC5C8 
UNCSCc 
UNCSCd 
Consensus 



UNC5C 
UNC5C8 
UNC5CC 
UNC5Cd 
Consensus 



UNC5C 
UNC5C8 
UNCSCC 
UNCSCd 
Consensus 



UNC5C 
UNC5C8 
UNCSCc 
UNC5Cd 
Consensus 



CAGCCTGAAG AACCAGAGTC TAG CAAGG C A GACTGATCCA TCCTGTACCG 
CAGCCTGAAG AACCAGAGTC TAGCAAGGCA GACTGATCCA TCCTGTACCG 

CAGCCTGAAG AACCAGAGTC TAGCAAGGCA GACTGATCCA TCCTGTACCG 
cagcctgaag aaccagagtc tagcaaggca gactgatcca tcctgtaccg 

401 450 
CATTTGG CAG CTTCAACTCG CTGGGAGGTC ACCTTATTGT TCCCAATTCA 
CATTTGGCAG CTTCAACTCG CTGGGAGGTC ACCTTATTGT TCCCAATTCA 

TATTGT TCCCAATTCA 

CATTTGGCAG CTTCAACTCG CTGGGAGGTC ACCTTATTGT TCCCAATTCA 
catttggcag cttcaactcg ctgggaggtc a c c t TATTGT TCCCAATTCA 



451 

GGAGTCAGCT 
GGAGTCAGCT 
GGAGTCAGCT 
GGAGTCAGCT 
GGAGTCAGCT 

501 

CGAAATGTAT 
CGAAATGTAT 
CGAAATGTAT 
CGAAATGTAT 
CGAAATGTAT 

551 

ATGACTCTCA 
ATGACTCTCA 
ATGACTCTCA 
ATGACTCTCA 
ATGACTCTCA 

601 

GCTCTGCTCA 
GCTCTGCTCA 
GCTCTGCTCA 
GCTCTGCTCA 
GCTCTGCTCA 

651 



TGCTGATTCC 
TGCTGATTCC 
TGCTGATTCC 
TGCTGATTCC 
TGCTGATTCC 



GTGACTGTAC 
GTGACTGTAC 
GTGACTGTAC 
GTGACTGTAC 
GTGACTGTAC 



GACACTTTTG 
GACACTTTTG 
GACACTTTTG 
GACACTTTTG 
GACACTTTTG 



CGCTGGGGCC 
CGCTGGGGCC 
CGCTGGGGCC 
CGCTGGGGCC 
CGCTGGGGCC 



ACAGGAAAGA 
ACAGGAAAGA 
ACAGGAAAGA 
ACAGGAAAGA 
ACAGGAAAGA 



ACCCCTGTGG 
ACCCCTGTGG 
ACCCCTGTGG 
ACCCCTGTGG 
ACCCCTGTGG 



ATTCCCCAAG 
ATTCCCCAAG 
ATTCCCCAAG 
ATTCCCCAAG 
ATTCCCCAAG 



AACTATGAGG 
AACTATGAGG 
AACTATGAGG 
AACTATGAGG 
AACTATGAGG 



TGAGCTGTGG 
TGAGCTGTGG 
TGAGCTGTGG 
TGAGCTGTGG 
TGAGCTGTGG 



CCCGCCCCGT 
CCCGCCCCGC 
CCCGCCCCGT 
CCCGCCCCGT 
CCCGCCCCGt 



CGTCCTCACT 
CGTCCTCACT 
CGTCCTCACT 
CGTCCTCACT 
CGTCCTCACT 



ATG CATCACT 
ATGCATCACT 
ATG CATCACT 
ATGCATCACT 
ATGCATCACT 



500 

GGAGAGTCTA 
GGAGAGTCTA 
GGAGAGTCTA 
GGAGAGTCTA 
GGAGAGTCTA 

550 

CCACCCATGG 
CCACCCATGG 
CCACCCATGG 
CCACCCATGG 
CCACCCATGG 

600 

GCCCCCAGGA 
GCCCCCAGGA 
GCCCCCAGGA 
GCCCCCAGGA 
GCCCCCAGGA 

650 

GCGCAGACCC 
GCGCAGACCC 
GCGCAGACCC 
GCGCAGACCC 
GCGCAGACCC 

700 



CAATACCGAG GACTGGAAAA TACTGCTCAA GAACCAGGCA GCACAGGGAC 

CAATACCGAG GACTGGAAAA TACTGCTCAA GAACCAGGCA GCACAGGGAC 

CAATACCGAG GACTGGAAAA TACTGCTCAA GAACCAGGCA GCACAGGGAC 

CAATACCGAG GACTGGAAAA TACTGCTC - - 

CAATACCGAG GACTGGAAAA TACTG CTC a a gaaccaggca gcacagggac 



701 



750 



AGTGGGAGGA TGTGGTGGTG GTCGGGGAGG AAAACTTCAC CACCCCCTGC 
AGTGGGAGGA TGTGGTGGTG GCCGGGGAGG AAAACTTCAC CACCCCCTGC 
AGTGGGAGGA TGTGGTGGTG GTCGGGGAGG AAAACTTCAC CACCCCCTGC 

agtgggagga tgtggtggtg g cggggagg aaaacttcac caccccctgc 



751 



800 



TACATTAAGC TGGATGCAGA GGCCTGCCAC ATCCTCACAG AGAACCTCAG 

TACATTAAGC TGGATGCAGA GGCCTGCCAC ATCCTCACAG AGAACCTCAG 

T ACATTC AG C TGGATGCAGA GGCCTGCCAC ATCCTCACAG AGAACCTCAG 

tacatt age tggatgeaga ggcctgccac atcctcacag agaacctcag 



801 



850 



CACCTACGCC CTGGTAGGAC ATTCCACCAC CAAAGCGGCT GCAAAGCGCC 
CACCTACGCC CTGGTAGGAC ATTCCACCAC CAAAGCGGCT GCAAAGCGCC 
CACCTACGCC CTGGTAGGAC ATTCCACCAC CAAAGCGGCT GCAAAGCGCC 

cacctacgcc ctggtaggac attccaccac caaagegget gcaaagcgcc 



851 

TCAAGCTGGC CAT CTTTGGG CCCCTGTGCT GCTCCTCGCT 
TCAAG CTGGC CATCTTTGGG CCCCTGTGCT GCTCCTCGCT 
TCAAGCTGGC CATCTTTGGG CCCCTGTGCT GCTCCTCGCT 

CTCGCT 

tcaagctggc catctttggg cccctgtgct gctcCTCGCT 

SUBSTTTUTE SHEET (RULE 26) 



900 

GGAGTACAGC 
GGAGTACAGC 
GGAGTACAGC 
GGAGTACAGC 
GGAGTACAGC 
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901 

UNC5C ATCCGAGTCT ACTGTCTGGA TGACACCCAG 
UNC5C8 ATCCGAGTCT ACTGTCTGGA TGACACCCAG 
UNC5CC ATCCGAGTCT ACTGTCTGGA TGACACCCAG 
UNC5Cd ATCCGAGTCT ACTGTCTGGA TGACACCCAG 
Consensus ATCCGAGTCT ACrnTrTraa TCAC B JCCCAG 



GATGCCCTGA 
GATGCCCTGA 
GGTGCCCTGA 
GATGCCCTGA 



950 

AGGAAATTTT 
AGGAAATTTT 
AGGAAATTTT 
AGGAAATTTT 



951 1000 
UNC5C ACATCTTGAG AGACAGACGG GAGGACAGCT CCTAGAAGAA CCTAAGGCTC 
UNC5C8 ACATCTTGAG AGAXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXAGGTTT 
UNC5CC ACATCTTGAG AGACAGACGG GAGGACAGCT CCTAGAAGAA CCTAAGGCTC 
UNCSCd ACATCTTGAG AGACAGACGG GAGGACAGCT CCTAGAAGAA CCTAAGGCTC 
Consensus ACATCTTGAG AGAcagacgg gaggacagct cctagaagaa cctaAGGcTc 



1001 

UNC5C TTCATTTTAA 
UNC5C8 TTCATTT-AA 
UNC5CC TTCATTTTAA 
UNC5Cd TTCATTTTAA 
Consensus TTCATTTtAA 



AGGCAGCACC CACAACCTGC 
AG C ANG CAN C CNNCAAATGN 
AGGCAGCACC CACAACCTGC 
AGGCAGCACC CACAACCTGC 
AGgcaGCAcC CacaAccTGc 



1050 

GCCTGTCAAT TCACGATATC 
GCCTGTCAAT TCNCGATATG 
GCCTGTCAAT TCACGATATC 
GCCTGTCAAT TCACGATATC 
GCCTGTCAAT TCaCGATATc 



1051 

UNC5C GCCCATTCCC 
UNC5C8 GCCCATTCCC 
UNCSCc GCCCGTTCCC 
UNC5Cd GCCCATTCCC 
Consensus GCCCaTTCCC 



TCTGGAAGAG CAAATTGCTG 
TCTGAAAGAG CAAATTGCTG 
TCTGGAAGAG CAAATTGCTG 
TCTGGAAGAG CAAATTGCTG 
TCTGgAAGAG CAAATTGCTG 



1100 

GCTAAATATC AGGAAATTCC 
GCTAAATATC AGGAAATTCC 
GCTAAATATC AGGAAATTCC 
GCTAAATATC AGGAAATTCC 
GCTAAATATC AGGAAATTCC 



1101 

UNC5C ATTTTACCAT 
UNC5C8 ATTTTACCAT 
UNC5CC ATTTTACCAT 
UNCSCd ATTTTACCAT 
Consensus ATTTTACCAT 



GTTTGGAGTG GATCTCAAAG 
GTTTGGAGTG GATCTCAAAG 
GTTTGGAGTG GATCTCAAAG 
GTTTGGAGTG GATCTCAAAG 
GTTTGGAGTG GATCTCAAAG 



1150 

AAACCTGCAC TGCACCTTCA 
AANCCTGCAC TGCACNTTCA 
AAACCTGCAC TGCACCTTCA 
AAACCTGCAC TGCACCTTCA 
AAaCCTGCAC TGCACcTTCA 



1151 

UNC5C CTCTGGAAAG 
UNC5C8 CTCTGGAAAG 
UNC5CC CTCTGGAAAG 
UNCSCd CTCTGGAAAG 
Consensus CTCTGGAAAG 

1201 

UNC5C GTGCGGCAGG 
UNC5C8 GT-CGGCAGG 
UNCSCc GTGCGGCAGG 
UNCSCd GTGCGGCAGG 
Consensus GTgCGGCAGG 

1251 

UNC5C GTCAGAGGAA 
UNC5C8 GTCAGAGGAA 
UNC5CC GTCAGAGGAA 
UNCSCd GTCAGAGGAA 
Consensus GTCAGAGGAA 



ATTTAG CCTG AACACAGTGG 
ATTT AG CCTG AACACAGTGG 
ATTTAGCCTG AACACAGTGG 
ATTTAGCCTG AACACAGTGG 
ATTTAGCCTG AACACAGTGG 



TGGAAGGAGA AGGGCAGATC 
TGGAAGGAGA AGG-CAGATC 
TGGAAGGAGA AGGGCAGATC 
TGGAAGGAGA AGGGCAGATC 
TGGAAGGAGA AGGg CAGATC 



CCTACTGGCA TCGATTTGCC 
CCTACTGGCA TCGATTTGCC 
CCTACTGGCA TCGATTTGCC 
CCTACTGGCA TCGATTTGCC 
CCTACTGGCA TCGATTTGCC 



1200 

AGCTGGTTTG CAAACTCTGT 
AGCTGGTTTG CAAACTCTGT 
AGCTGGTTTG CAAACTCTGT 
AGCTGGTTTG CAAACTCTGT 
AGCTGGTTTG CAAACTCTGT 

1250 

TTCCAGCTCA ACTGCACCGT 
TTCCAGCTCA ACTGCACCGT 
TTCCAGCTCA ACTGCACCGT 
TTCCAGCTCA ACTGCACCGT 
TTCCAGCTCA ACTGCACCGT 

1300 

GCTGCTGGAT CCTGCGAACA 
GCTGCTGGAT CCTGCGAACA 
GCTGCTGGAT CCTGCGAACA 
GCTGCTGGAT CCTGCGAACA 
GCTGCTGGAT CCTGCGAACA 



1301 

UNC5C CCATCACCAC 
UNC5C8 CCATCACCAC 
UNC5CC CCATCACCAC 
UNCSCd CCATCACCAC 
Consensus CCATCACCAC 



GGTCACGGGG CCCAGTGCTT 

GGTCACGGGG CCCAGTGCTT 

GGTCACGGGG CCCAGTGCTT 

GGTCACGGGG CCCAGTGCTT 

GGTCACGGGG CCCAGTGCTT 



1350 

TCAGCATCCC TCTCCCTATC 
TCAGCATCCC TCTCCCTATC 
TCAGCATCCC TCTCCCTATC 
TCAGCATCCC TCTCCCTATC 
TCAGCATCCC TCTCCCTATC 



UNC5C 
UNC5C8 
UNCSCc 
UNCSCd 
Consensus 



UNC5C 
UNC5C8 
UNC5CC 
UNC5Cd 
Consensus 



1351 

CGGCAGAAGC 
CGG CAGAAGC 
CGGCAGAAGC 
CGGCAGAAGC 
CGGCAGAAGC 



TCTGTAGCAG 
TCTG TAG CAG 
TCTGTAGCAG 
TCTGTAGCAG 
TCTGTAGCAG 



CCTGGATGCC 
CCTGGATGCC 
CCTGGATGCC 
CCTGGATGCC 
CCTGGATGCC 



1401 

CTGGAGGATG CTGGCCCATA AG CTGAACCT 

CTGGAGGATG CTGGCCCATA AG CTGAACCT 

CTGGAGGATG CTGGCCCATA AG CTGAACCT 

CTGGAGGATG CTGGCCCATA AG CTGAACCT 

CTGGAGGATG CTGGCCCATA AG CTGAACCT 

SUBSTITUTE SHEET (RULE 26) 



CCCCAGACGA 
CCCCAGACGA 
CCCCAGACGA 
CCCCAGACGA 
CCCCAGACGA 



GGACAGGTAC 
GGACGGGTAC 
GGACAGGTAC 
GGACAGGTAC 
GGACaGGTAC 



1400 
GAGG CCATG A 
GAGG CCATGA 
GAGG CCATG A 
GAGG CCATGA 
GAGG CCATGA 

1450 
TTGAATTACT 
TTGAATTACT 
TTGAATTACT 
TTGAATTACT 
TTGAATTACT 
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UNC5C 
UNC5C8 
UNC5CC 
UNC5Cd 
Consensus 



1451 

TTGCCACCAA 
TTGCCACCAA 
TTGCCACCAA 
TTGCCACCAA 
TTGCCACCAA 



ATCCAGCCCA 
ATCCAGCCCA 
ATCCAGCCCA 
ATCCAGCCCA 
ATCCAGCCCA 



ACTGGCGTAA 
ACTGGCGTAA 
ACTGGCGTAA 
ACTGGCGTAA 
ACTGGCGTAA 



TCCTGGATCT 
TCCTGGATCT 
TCCTGGATCT 
TCCTGGATCT 
TCCTGGATCT 



1500 
TTGGGAAGCA 
TTGGGAAGCA 
TTGGGAAGCA 
TTGGGAAGCA 
TTGGGAAGCA 



UNC5C 
UNC5C8 
UNC5CC 
UNC5Cd 
Consensus 



1501 

CAGAACTTCC 
CAGAACTTCC 
CAGAACTTCC 
CAGAACTTCC 
CAGAACTTCC 



CAGATGGAAA 
CAGATGGAAA 
CAGATGGAAA 
CAGATGGAAA 
CAGATGGAAA 



CCTGAGCATG 
CCTGAGCATG 
CCTGAGCATG 
CCTGAGCATG 
CCTGAGCATG 



CTGG CAGCTG 
CTGGCAGCTG 
CTGGCAGCTG 
CTGGCAGCTG 
CTGGCAGCTG 



1550 
T CTTGGAAG A 
TCTTGGAAGA 
T CTTGGAAG A 
TCTTGGAAGA 
TCTTGGAAGA 



UNC5C 
UNC5C8 
UNC5CC 
UNC5Cd 
Consensus 



1551 

AATGGGAAGA 
AATGGGAAGA 
AATGGGAAGA 
AATGGGAAGA 
AATGGGAAGA 



CATGAAACGG 
CATGAAACGG 
CATGAAACGG 
CATGAAACGG 
CATGAAACGG 



TGGTGT CCTT 
TGGTGTCCTT 
TGGTGTCCTT 
TGGTGTCCTT 
TGGTGTCCTT 



AGCAGCAGAA 
AGCAGCAGAA 
AGCAGCAGAA 
AGCAGCAGAA 
AGCAGCAGAA 



1599 
GGGCAGTAT 
GGGCAGTAT 
GGGCAGTAT 
GGGCAGTAT 
GGGCAGTAT 
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Multalin version 5.3.3 

Copyright I.N.R.A. France 1989, 1991, 1994, 1996 

Publichcd research usiiiy Lfti t> software should cite 

Multiple sequence alignment with hierarchical clustering 

F, CORPET, 1988, Nucl . Acids Res., 16 22, 10881-10890 

Symbol comparison table : blosum62 

Gap weight : 12 

Gap length weight: 2 

Consensus levels: high=90% low=50% 

Consensus symbols: 

! is anyone of IV 

$ is anyone of LM 

% is anyone of FY 

# is anyone of NDQEBZ 



MSF: 


2908 Check 


: 0 














Name : 


ratunc5hl 


Len : 


2908 


Check: 


8912 


Weight 


0. 


87 


Name : 


ym97d!2 


Len : 


2908 


Check : 


4745 


Weight - 


0 . 


87 


Name : 


1G 


Len : 


2908 


Check : 


1058 


Weight 


; 1. 


05 


Name : 


Urc 


Len : 


2908 


Check : 


508 


Weight • 


1. 


04 


Name : 


2Brc 


Len: 


2908 


Check: 


6768 


Weight : 


1. 


04 


Name : 


3D 


Len: 


2908 


Check : 


8193 


Weight - 


1. 


13 


Name : 


Consensus 


Len: 


2908 


Check : 


6031 


Weight . 


6. 


00 



// 



1 50 
ratunc5hl ATGGCCGTCC GGCCCGGCCT GTGGCCAGTG CTCCTGGGCA TAGTCCTCGC 
ym97d!2 
1G 
Urc 
2Brc 
3D 

Consensus 



51 100 
ratuncShl CGCCTGGCTT CGTGGTTCGG GTGCCCAGCA GAGTGCCACG GTGGCCAATC 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



101 150 
ratunc5hl CAGTGCCCGG TGCCAACCCC GACCTGCTGC CCCACTTCCT GGTAG AG CCT 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



151 200 
ratuncShl GAGGACGTGT ACATTGTCAA GAACAAGCCG GTGTTGTTGG TGTGCAAGGC 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 
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ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



201 

TGTGCCTGCC ACCCAGATCT TCTTCAAGTG CAATGGGGAA 



250 

TGGGTCCGCC 



ratunc5hl 
ym97d!2 
1G 
Urc 
2Brc 
3D 

Consensus 



251 

AGGTCGATCA 



CGTAATTGAA CGCAGCACCG AC AG CAG C AG 



300 

CGGATTGCCA 



ratunc5hl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



301 350 
ACCATGGAGG TCCGTATCAA CGTATCGAGG CAGCAGGTAG AGAAAGTGTT 



ratunc5hl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



351 400 
TGGGCTGGAG GAATACTGGT GCCAGTGTGT GGCATGGAGC TCCTCGGGTA 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



401 450 
CCACCAAAAG TCAGAAGGCC TACATCCGGA TTGCCTATTT GCGCAAGAAC 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



451 500 
TTTGAGCAGG AGCCACTGGC CAAGGAAGTG TCACTGGAGC AAGGCATTGT 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



501 550 
ACTACCTTGT CGCCCCCCAG AAGGAATCCC CCCAGCTGAG GTGGAGTGGC 
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ratuncShl 
ym97d!2 



551 600 
TTCGAAATGA GGACCTCGTG GACCCCTCCC TCGATCCCAA TGTGTACATC 



Urc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



601 650 
ACGCGGGAGC ACAGCCTAGT CGTGCGTCAG GCCCGCCTGG CCGACACGGC 



ratuncShl 
ytn97d!2 
1G 
Urc 
2Brc 
3D 

Consensus 



651 700 
CAACTACACC TGTGTGGCCA AGAACATCGT AGCCCGTCGC CGAAG CACCT 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



701 750 
CTGCAGCGGT CATTGTTTAT GTGAACGGTG GGTGGTCGAC GTGGACTGAG 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



751 800 
TGGTCCGTCT GCAGCGCCAG CTGTGGGCGT GGCTGGCAGA AACGGAGCCG 



ratunc5hl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



801 

GAGCTGCACC AACCCGGCAC 



CTCTCAACGG GGGCGCCTTC 



850 

TGTGAGGGGC 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



851 

AGAATGTCCA GAAAACAGCC 



TGCGCCACTC 



900 

TGTGCCCAGT GGATGGGAGC 
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ratuncShl 
ym97dl2 
1G 
IJrc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
1G 
IJrc 
2Brc 
3D 

Consensus 



ratunc5hl 
ym97d!2 
1G 
IJrc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
1G 
IJrc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
1G 
IJrc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
1G 
IJrc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
1G 
IJrc 
2Brc 
3D 

Consensus 



901 950 
TGGAGTTCGT GGAGTAAGTG GTCAGCCTGT GGGCTTGACT GCACCCACTG 



951 1000 
GCGGAGCCGC GAGTGCTCTG ACCCAGCACC CCGCAATGGA GGTGAGGAGT 



1001 1050 
GTCGGGGTGC TGACCTGGAC ACCCGCAACT GTACCAGTGA CCTCTGCCTG 



CAGTGA CCTCTGTGTA 



1051 "00 
CACACCGCTT CTTGCCCCGA GGACGTGGCT CTCTACATCG GCCTTGTCGC 



CACACTGCTT CTGGCCCTGA GGACGTGGCC CTCTATGTGG GCCTNATCGC 



Predicted transmembrane region 
1101 1150 

TGTGGCTGTG TGCCTCTTCT TGCTGTTGCT GGCCCTTGGA CTCATTTACT 



CGTGGCCGNN TGCCTGGTCC TGCTGCTGCT TGTCCTCATC CTCGTTTATT 
tt c g cc ccctta 

1151 1200 
GTCGCAAGAA GGAAGGGCTG GACTCCGATG TGGCCGACTC GTCCATCCTC 



GCCGGAAGAA GGAGGGGCTG GACTCANATG TGGCTGACTC GTCCATTCTC 
gcc aa gg g ga g t c ga c t t tc 

1201 1250 
ACCTCGGGCT TCCAGCCTGT C AG CATC AAG CCCAGCAAAG CAGACAACCC 



ACCTCAGGCT TCCAGCCCGT CAGCATCAAG CCCAGCAAAG CAGACAACCC 
cc a t t g cc t age a ca g c cc 
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ratunc5hl 
ym97dl2 



1251 1300 
CCACCTGCTC ACCATCCAGC CAGACCTCAG CACCACCACT ACCACCTACC 



Urc CCTTGGGTTC - CCNTCAAGT GGTNNCANGG GGGTGGCCCT TGAA--TTCA 

2Brc ACTTGGGTTC - CCNTCAAGT TGT--CAATG GGGNGCCCCT --GA--ATCA 

3D CCATCTGCTC ACCATCCAGC CGGACCTCAG CACCACCACC ACCACCTACC 

Consensus cc t g tc cc tc ag g c g cc c ate 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



1301 

AGGGCAGTCT ATGTTCGAGG 



AGGG CAGTCT NTGTCCCCGG 
ag at tgt gg 



1350 

CAGGATGGAC CCAGCCCCAA GTTCCAGCTC 



CAGGATGGGC CCAGCCCCAA GTTCCAGCTC 
gg tgg c age c ccag 



ratunc5hl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



1351 1400 
TCTAATGGTC ACCTGCTCAG CCCACTGGGG AGTGGCCGCC ATACGTTGCA 

GCC AC AC- -TGCA 
TCAG CCCCCTGGGT GGCGGCCGCC ACACACTGCA 



ACCAATGGGC ACCTGCTCAG CCCCCTGGGT GGCGGCCGCC ACACACTGCA 
aa g c cct tcag ccc cctggg g ggccgCC acac tGCA 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



1401 1450 
CCACAGCTCA CCCACCTCTG AGGCTGAGGA CTTCGTCTCC CGCCTCTCCA 
CCACAGCTCT CCCACCTCTG AGGCCGAGGA GTTCGTCTCC CGCCTCTCCA 
CCACAGCTCT CCCACCTCTG AGGCCGAGGA GTTCGTCTCC CGCCTCTCCA 



CCACAGCTCT CCAACCTNTG AGGCCNAGGA GTTCGNNTCC CGCCTTTCCA 
cCacagCtct cCcacctctG aggee AGGa gttCg tec cGccT Tcca 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



1451 1500 

CCCAAAACTA CTT - TCGTTC CCTGCCCCGC GGCACCAGCA ACATGGCCTA 

CCCAGAACTA CTT-CCGCTC CCTGCCCCGA GGCACCAGCA ACATGACCTA 

CCCAGAACTA CTT-CCGCTC CCTGCCCCGA GGCACCAGCA ACATGACCTA 



CCCAGAACTA CTTNCGGTTC CTTGCCCCCA GGCNCCAGCA ACATGACCTT 
cccagaacTa ctT cgGttC ctTgccCcga GGc ccagca acAtGaCCT 



ratunc5hl 
ym97d!2 
1G 
Urc 
2Brc 
3D 

Consensus 



1501 1550 
C--GGGACCT TCA-ACTTCC TCGGGGG-CC GGCTGATGAT - - CCCTAATA 
T- -GGGACCT TCA-ACTTCC TCGGGGG-CC GGCTGATGAT --CCCTAATA 
T - - GGGACCT TCNNACTTCC TCGGGGG-CC GGCTGATGAT --CCCTAATA 



ATGGGGACCT TTAAATTTCT TCGGGGGNCC GGNTTATGAA NCCCTAATTC 
gGGaCCT t acTTCc TcggggG CC Gg t atga cc atTc 



ratuncShl 
ym97dl2 
1G 



1551 

CGGGGA--TC 
CAGGAA--TC 
CAGGAA- -TC 
Urc CAGGAA- -TC 
2Brc CAGGAA- -TC 
3D CAGGGAATTA 
Consensus CaGGaA Tc 



1600 

AGCCTCCT-C ATACCCCCGG ATGCCATCCC CC-GAGGAAA 
AGCCTCCT-C ATCCCCCCAG ATGCCATACC CC-GAGGGAA 
AGCCTCCT-C ATNCCCCCAG ATGCCATACC CC-GAGGGAA 
AGCCTCCT-C ATCCCCCCAG ATGCCATACC CC-GAGGGAA 
AGCCTCCT-C ATCCCCCCAG ATGCCATACC CC-GAGGGAA 
AACCTTCTTA ATCCCCCCAA ATG CCANACC CCCGANGGAA 
AgCCTcCT c ATcCCCCCag ATGCCA t a CC CC GAgGgAA 
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ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97d!2 
1G 
Urc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



ratunc5hl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



1601 

GATCT-ACGA 
GATCT-ATGA 
GATCT-ATGA 
GATCT-ATGA 
GATCT-ATGA 
NATCTNTTGN 
gATCT atGa 

1651 

CCCCTAGCTG 
CCCCTAGCTG 
CCCCTAGCTG 
CCCCTAGCTG 
CCCCTAGCTG 
AACC 

cccctagctg 
1701 

CCCA - GGAGT 
CCCT-GGCGT 
CCCT-GGCGT 
CCCT-GGCGT 
CCCT-GGCGT 



GATCTACCTC 
GATCTACCTC 
GATCTGCCTC 
GATCTACCTC 
GATCTACCTC 
NAACTACCTT 
gAtCTaCCTc 



GCTGTCAGAC 
GCTGTCAGAC 
GCTGTCAGAC 
GCTGTCAGAC 
GCTGTCAGAC 



ACACTGCACA 
ACGCTGCACA 
ACGCTGCACA 
ACGCTGCACA 
ACGCTGCGCA 

A A 

AcgctgcacA 



CCTGCTGAGT 
CCTGCTGAGT 
CCTGCTGAGT 
CCTGCTGAGT 
CCTGCTGAGT 



AG CCAG AAG A 
AGCCGGAAGA 
AGCCGGAAGA 
AGCCGGAAGA 
AGCCGGAAGA 
ANCTTGANNA 
AgCcgGAagA 



CCAGTCGTTA 
CCCATCGTTA 
CCCATCGTTA 
CCCATCGTTA 
CCCATCGTTA 



1650 
CGTGAGGTTG 
CGTGAGGTTG 
CGTGAGGTTG 
CGTGAGGTTG 
CGTGAGGTTG 
AGCCCGGAAA 
cGtgaGGttg 

1700 
GCTGTGGGCC 
GCTGTGGACC 
GCTGTGGACC 
GCTGTGGACC 
GCTGTGGACC 



gctgtcagac cctgctgagt cccatcgtta gctgtggacc 



CCTGCTCACC 
CCTGCTCACC 
CCTGCTCACC 
CCTGCTCACC 
CCTGCTCACC 



CGGCCAGTCA T-CCTTG 

CGGCCAGTCA T-CCTGG 

CGGCCAGTCA T-CCTGG 

CGGCCAGTCA T-CCTGG 

CGGCCAGTCA T-CCTGG 



1750 

-CA ATGGACCACT 
-CT ATGGACCACT 
-CT ATGGACCACT 
-CT ATGGACCACT 
-CT ATGGACCACT 



ccct ggcgt cctgctcacc cggccagtca t cctgg ct atggaccact 



1751 

GT- -GG AG AG CCCA-GCCCT - GACAGC 

GT--GGGGAG CCCA-GCCCT -GACAGC 
GT- -GGGGAG CCCA-GCCCT 

GT--GGGGAG CCCA-GCCCT -GACAGC 
GT--GGGGAG CCCA-GCCCT 



T GGAGTC - TGC GCCT- 

T GGAGCC-TGC GCCT- 

G ACAGC - - T GGAGCC-TGC GCCT- 

T GGAGCC-TGC GCCT- 

GACAGC--T GGAGCC-TGC GCCT- 



1800 
- -CAA 
--CAA 
-CAA 
-CAA 
-CAA 



gt ggggag ccca gccct gacagc t ggagcc tgc gcct caa 



1801 

AAAGCAG- 
AAAGCAG- 
AAAGCAG- 
AAAGCAG- 
AAAGCAG - 



TC CTGC - GAGGG CAGTTGGG- 

TC GTGC- GAGGG CAGCTGGG - 

TC GTGC -GAGGG CAGCTGGG - 

TC GTGC -GAGGG CAGCTGGG - 

TC GTGC -GAGGG CAGCTGGG - 



-AGGATGTGC 
- AGGATGTGC 
- AGGGTGTGC 
-AGGATGTGC 
-AGGATGTGC 



1850 
-TGCACCT-T 
-TGCACCT-G 
-TGCACCT-G 
-TGCACCT-G 
-TGCACCT-G 



aaagcag tc tgc gaggg cagctggg 



aggatgtgc tgcacct g 



1900 



1851 

GGTGAGGAGT CACCTTCCCA CCTCTACTAC TGCCAGCTGG AGGCCGGGGC 
GGCGAGGAGG CGCCCTCCCA CCTCTACTAC TGCCAGCTGG AGGCCAGTGC 
GGCGAGGAGG CGCCCTCCCA CCTCTACTAA NTAAANCCCN AA-TTNTTGC 
GGCGAGGAGG CGCCCTCCCA CCTCTACTAG 
GGCGAGGAGG CGCCCTCCCA CCTCTACTAA G 

ggcgaggag cgccctccca cctctacta 



1901 

CTGCTATGTC 
CTGCTACGTC 
AAAAATCCNT 



1950 

TTCACGGAGC AGCTGGGCCG CTTTGCCCTG GTAGGAGAGG 
TTCACCGAGC AGCTGGGCCG CTTTGCCCTG GTGGGAGAGG 
TTAAAATTGT NG--GNCCCN TTNAAACCTN 
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ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



1951 2000 
CCCTCAGCGT GGCTGCCACC AAGCGCCTCA GGCTCCTTCT GTTTGCTCCC 
CCCTCAGCGT GGCTGCCGCC AAGCGCCTCA AGCTGCTTCT GTTTGCGCCG 
CCCTTAAAAA GGGGCCCAAT TTCCNCCTNT NNGGNANCCN --TTNAAAAN 



ratunc5hl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



2001 2050 
GTGGCCTGTA CGTCCCTTGA GTACAACATC CGAGTGTACT GCCTACACGA 
GTGGCCTGCA CCTCCCTCGA GTACAACATC CGGGTCTACT GCCTGCATGA 
NTAACTGGCC CCTNTTTTNA AAACNNNCGA NCNGGGNAAA NCC 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



2051 2100 
CACCCACGAC GCTCTCAAGG AGGTGGTGCA GCTGGAGAAG CAG CTAGGTG 
CACCCACGAT GCACTCAAGG AGGTGGTGCA GCTGGAGAAG CAGCTGGGGG 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



2101 2150 
GACAGCTGAT CCAGGAGCCT CGCGTCCTGC ACTTCAAAGA CAGTTACCAC 
GACAGCTGAT CCAGGAGCCA CGGGTCCTGC ACTTCAAGGA CAGTTACCAC 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



2151 2200 
AACCTACGTC TCTCCATCCA CGACGTGCCC AGCTCCCTGT GGAAGAGCAA 
AACCTGCGCC TATCCATCCA CGATGTGCCC AGCTCCCTGT GGAAGAGTAA 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



2201 2250 
GCTACTTGTC AGCTAC CAGG AGATCCCTTT TTACCACATC TGGAACGGCA 
GCTCCTTGTC AG CTACCAGG AGATCCCCTT TTATCACATC TGGAATGGCA 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



2251 2300 
CCCAGCAGTA TCTGCACTGC ACCTTCACCC TGGAGCGCAT CAACGCCAGC 
CGCAGCGGTA CTTGCACTGC ACCTTCACCC TGGAGCGTGT CAGCCCCAGC 
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^? (Pa+'sw/y^z? 7). 



ratunc5hl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



2301 2350 
ACCAGCGACC TGGCCTGCAA GGTGTGGGTG TGGCAGGTGG AGGGAGATGG 
ACTAGTGACC TGGCCTGCAA GCTGTGGGTG TGGCAGGTGG AGGGCGACGG 



ratunc5hl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



2351 2400 
G C AG AG CTTC AACATCAACT TCAACATCAC TAAGGACACA AGGTTTGCTG 
GCAGAGCTTC AG CATCAACT TCAACATCAC CAAGGACACA AGGTTTGCTG 



ratunc5hl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



2401 2450 
AATTGTTGGC TCTGGAGAGT GAAGGGGGGG TCCCAGCCCT GGTGGGCCCC 
AGCTGCTGGC TCTGGAGAGT GAAGCGGGGG TCCAAGCCCT GGTGGGCCCC 



ratuncShl 
ym97d!2 
1G 
Urc 
2Brc 
3D 

Consensus 



2451 2500 
AGTGCCTTCA AGATCCCCTT CCTCATTCGG CAAAAGATCA TCGCCAGTCT 
AGTGCCTTCA AGATCCCCTT CCTCATTCGG CAGAAGATAA TTTCCAGCCT 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



2501 2550 
GGACCCACCC TGCAGCCGGG GCGCCGACTG GAGAACTCTA GCCCAGAAAC 
GGACCCACCC TGTAGGCGGG GTGCCGACTG GCGGACTCTG GCCCAGAAAC 



ratunc5hl 
ym97d!2 
1G 
Urc 
2Brc 
3D 

Consensus 



2551 2600 
TTCACCTGGA CAGCCATCTT AGCTTCTTTG CCTCCAAGCC CAGCCCTACA 
TCCACCTGGA CAGCCATCTC AGCTTCTTTG CCTCCAAGCC CAGCCCCACA 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



2601 2650 
GCCATGATCC TCAACCTATG GGAGGCACGG CACTTCCCCA ACGGCAACCT 
GCCATGATCC TCAACCTGTG GGAGGCGCGG CACTTCCCCA ACGGCAACCT 
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ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



ratunc5hl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97d!2 
1G 
Urc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



ratuncShl 
ym97dl2 
1G 
Urc 
2Brc 
3D 

Consensus 



2651 2700 
CGGCCAGCTG GCAGCAGCTG TGGCCGGACT GGGCCAACCA GATGCTGGCC 
CAGCCAGCTG GCTGCAGCAG TGGCTGGACT GGGCCAGCCA GACGCTGGCC 



2701 2750 
TCTTCACGGT GTCGGAGGCC GAGTGTTGA 

TCTTCACAGT GTCGGAGGCT GAGTG CTGAG GCCGGCCAGG CCCGACACCT 



2751 2800 
ACACTCTCAC CAGCTTTGGC ACCCACCAAG GACAGGCAGA AGCCGGACAG 



2801 2850 
GGGCCCTTCC CCACACCGGG GAG AG CTGCT CGGACAGGCC CCCTCCCGGC 



2851 2900 
CGAAGCTGTC CCTTAATGCT GGTCCTTCAG ACCCTGCCCC CTCGTGCCGA 



2901 

ATTCTGGC 
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gi!205715 (M96376) neurexin II-alpha-b [Rattus norvegicus] 3J. 7.4 
qi | 205715 (M96376) neurexin II-alpha-b [Rattus norvegicus] 
Length = 1728 

Score =31.3 bits (69), Expect =7.4 

Identities = 16/38 (42%), Positives = 20/38 (52%) 

Query: 337 KACSVCXAGRRALMGKLLEEQGXGVGGRGKANADIYYR 224 

KAC VC + GK LEE+G G G G+ IY + 

Sbjct: 1690 KACCVCRCRATCIAGKPLEERGGG-RGEGERQMQI YIK 1726 



y<7& 7 

gi | 1644455 (U72520) mena protein [Mus musculus] 
Length =541 

Score =34,0 bits (76), Expect = 0.77 

Identities = 14/23 (60%), Positives = 15/23 (64%) 

Frame = +1 

Query: 31 PPPPCTCPAGRHRVSALPPPAGP 99 

PPPP P+G SALPPP GP 

Sbjct: 284 PPPPPPLPSGPAYASALPPPPGP 306 
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1 

SEQUENCE LISTING 



<110> DEVGEN NV 

<120> UNC-5 constructs and screening methods 

<130> SCB/52877/002 

<140> 
<141> 

<160> 96 

<170> Patentln Ver. 2.0 



<210> 1 
<211> 1393 
<212> DNA 

<213> Homo sapiens 



<400> 1 

ttgtttgtgt 

ctcaatgggg 

cccccagacc 

gtctcagaca 

atcaaagtgt 

tccaagctgt 

aaccagagtc 

ctgggaggtc 

attccccaag 

ccacccatgg 

gctctgctca 

gactggaaaa 

ccaggatgcc 

agaacctaag 

tatcgcccat 

ccatgtttgg 

cctgaacaca 

gatcttccag 

ggatcctgcg 

tatccggcag 

gatgctggcc 

cccaactggc 

catgctggca 

agaagggcag 



atcggaagaa 
gctttcagcc 
tcacgtcagc 
aaatcccaat 
acaacacctc 
cccctcagat 
tagcaaggca 
accttattgt 
ggagagtcta 
atgactctca 
cccgccccgt 
tactgctcct 
ctgaaggaaa 
gctcttcatt 
tccctctgga 
agtggatctc 
gtggagctgg 
ctcaactgca 
aacaccatca 
aagctctgta 
cataagctga 
gtaatcctgg 
gctgtcttgg 
tat 



tcatcgtgac 
tgtgaacatc 
tgcagccatg 
gaccaactct 
aagtgctgtc 
gacccagtcg 
gactgatcca 
tcccaattca 
cgaaatgtat 
gacacttttg 
cgtcctcact 
cgctggagta 
ttttacatct 
ttaaaggcag 
agagcaaatt 
aaagaaacct 
tttgcaaact 
ccgtgtcaga 
ccacggtcac 
gcagcctgga 
acctggacag 
atctttggga 
aagaaatggg 



tttgagtcag 
aaggcagcaa 
tacagaggac 
ccaattctgg 
accccccaag 
ttgttggaga 
tcctgtaccg 
ggagtcagct 
gtgactgtac 
acccctgtgg 
atgcatcact 
cagcatccga 
tgagagacag 
cacccacaac 
gctggctaaa 
gcactgcacc 
ctgtgtgcgg 
ggaacctact 
ggggcccagt 
tgccccccag 
gtacttgaat 
agcacagaac 
aagacatgaa 



atattattga 
gacaagatct 
ctgtctatgc 
atccactgcc 
atgacctctc 
atgaagccct 
catttggcag 
tgctgattcc 
acaggaaaga 
tgagctgtgg 
gcgcagaccc 
gtctactgtc 
acgggaggac 
ctgcgcctgt 
tatcaggaaa 
ttcactctgg 
caggtggaag 
ggcatcgatt 
gctttcagca 
acgagaggcc 
tactttgcca 
ttcccagatg 
acggtggtgt 



ctcttcggca 
gctggctgta 
cctgcatgac 
caacctgaaa 
tgagtttacg 
cagcctgaag 
cttcaactcg 
cgctggggcc 
aactatgagg 
gcccccagga 
caataccgag 
tggatgacac 
agctcctaga 
caattcacga 
ttccatttta 
aaagatttag 
gagaagggca 
tgccgctgct 
tccctctccc 
atgactggag 
ccaaatccag 
gaaacctgag 
ccttagcagc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1393 



<210> 2 
<211> 238 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Leu Phe Val Tyr Arg Lys Asn His Arg Asp Phe Glu Ser Asp lie lie 
1 5 10 15 

Asp Ser Ser Ala Leu Asn Gly Gly Phe Gin Pro Val Asn lie Lys Ala 
20 25 30 
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Ala Arg Gin Asp Leu Leu Ala Val Pro Pro Asp Leu Thr Ser Ala Ala 
35 40 ' 45 

Ala Met Tyr Arg Gly Pro Val Tyr Ala Leu His Asp Val Ser Asp Lys 
50 55 60 

lie Pro Met Thr Asn Ser Pro He Leu Asp Pro Leu Pro Asn Leu Lys 
65 70 75 80 

He Lys Val Tyr Asn Thr Ser Ser Ala Val Thr Pro Gin Asp Asp Leu 

85 90 95 

Ser Glu Phe Thr Ser Lys Leu Ser Pro Gin Met Thr Gin Ser Leu Leu 
100 105 110 

Glu Asn Glu Ala Leu Ser Leu Lys Asn Gin Ser Leu Ala Arg Gin Thr 
115 120 125 

Asp Pro Ser Cys Thr Ala Phe Gly Ser Phe Asn Ser Leu Gly Glv His 
130 135 140 

Leu He Val Pro Asn Ser Gly Val Ser Leu Leu He Pro Ala Gly Ala 
145 150 155 160 

lie Pro Gin Gly Arg Val Tyr Glu Met Tyr Val Thr Val His Arg Lys 
165 170 175 

Glu Thr Met Arg Pro Pro Met Asp Asp Ser Gin Thr Leu Leu Thr Pro 
180 185 190 

Val Val Ser Cys Gly Pro Pro Gly Ala Leu Leu Thr Arg Pro Val Val 
195 200 205 

Leu Thr Met His His Cys Ala Asp Pro Asn Thr Glu Asp Trp Lys He 
210 215 220 

Leu Leu Leu Ala Gly Val Gin His Pro Ser Leu Leu Ser Gly 
225 230 235 



<210> 3 
<211> 1438 
<212> DNA 

<213> Homo sapiens 



<400> 3 

ttgtttgtgt 

ctcaatgggg 

cccccagacc 

gtctcagaca 

atcaaagtgt 

gctgartccc 

caggaaagaa 

gagctgtggg 

cgcagacccc 

gtgggaggat 

ggatgcagag 



atcggaagaa 
gctttcagcc 
tcacgtcagc 
aaatcccaat 
acaacacctc 
gctggggcca 
actatgaggc 
cccccaggag 
aataccgagg 
gtggtggtgg 
gcctgccaca 



tcatcgtgac 
tgtgaacatc 
tgcagccatg 
gaccaactct 
aggtgctgtc 
ttccccaagg 
cacccatgga 
ctctgctcac 
actggaaaat 
tcggggagga 
tcctcacaga 



tttgagtcag 
aaggcagcaa 
tacagaggac 
ccaattctgg 
acctattgtt 
gagagtctac 
tgactctcag 
ccgccccgtc 
actgctcaag 
aaacttcacc 
gaacctcagc 



atattattga 
gacaagatct 
ctgtctatgc 
atccactgcc 
cccaattcag 
gaaatgtatg 
acacttttga 
gtcctcacta 
aaccaggcag 
accccctgct 
acctacgccc 



ctcttcggca 
gctggctgta 
cctgcatgac 
caacctgaaa 
gagtcagctt 
tgaccgtaca 
cccctgtggt 
tgcatcactg 
cacagggaca 
acattcagct 
tggtaggaca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 
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ttccaccacc 
ctcctcgctg 
ggaaatttta 
tcattttaaa 
ctggaagagc 
atctcaaaga 
qctggtttgc 
ctgcaccgtg 
catcaccacg 
ctgtagcagc 
gctgaacctg 
cctggatctt 
cttggaagaa 



aaagcggctg 
gagtacagca 
catcttgaga 
ggcagcaccc 
aaattgctgg 
aacctgcact 
aaactctgrg 
tcagaggaac 
gtcacggggc 
ctggatgccc 
gacaggtact 
tgggaagcac 
atgggaagac 



caaagcgcct 
tccgagtcta 
gacagacggg 
acaacctgcg 
ctaaatatca 
gcaccttcac 
tgcggcaggt 
ctactggcat 
ccagtgcttt 
cccagacgag 
tgaattactt 
agaacttccc 
atgaaacggt 



caagctggcc 
ctgtctggat 
aggacagctc 
cctgtcaatt 
ggaaattcca 
tctggaaaga 
ggaaggagaa 
cgatttgccg 
cagcatccct 
aggccatgac 
tgccaccaaa 
agatggaaac 
ggtgtcctta 



atctttgggc 
gacacccagg 
ctagaagaac 
cacgatatcg 
ttttaccatg 
tttagcctga 
gggcagatct 
ctgctggatc 
ctccctatcc 
tggaggatgc 
tccagcccaa 
ctgagcatgc 
gcagcagaag 



<210> 4 
<211> 130 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Leu Phe Val Tyr Arg Lys Asn His Arg Asp Phe 
15 10 



ccctgtgctg 
gtgccctgaa 
ctaaggctct 
cccgttccct 
tttggagtgg 
acacagtgga 
tccagctcaa 
ctgcycidcaC 

ggcagaagct 
tggcccataa 
ctggcgtaat 
tggcagctgt 
ggcagtat 



720 

780 

840 

900 

960 

1020 

1080 

^ -> a n 

1200 
1260 
1320 
1380 
1438 



Glu Ser Asp He He 
15 



Asp Ser 
Ala Arg 



Ala Met 
50 



Ser Ala 
20 

Gin Asp 
35 



Leu Asn Gly Gly 



Leu Leu Ala Val 
40 



Tyr Arg Gly Pro Val Tyr 
55 



Phe Gin Pro 
25 

Pro Pro Asp 
Ala Leu His 



Val Asn 



Leu Thr 
45 

Asp Val 
60 



He Lys Ala 
30 

Ser Ala Ala 



Ser Asp Lys 



He Pro Met Thr Asn Ser Pro lie 
65 70 



Leu Asp Pro 
75 



Leu Pro 



Asn Leu Lys 
80 



He Lys Val Tyr 
Arg Ser 
Leu Arg 



Gin Leu 
100 



Asn Val 
115 



Asn Thr Ser Gly 
85 

Ala Asp Ser Arg 



Cys Asp Cys Thr 
120 



Ala Val Thr 
90 

Trp Gly His 
105 

Gin Glu Arg 



Tyr Cys 



Ser Pro 



Asn Tyr 
125 



Ser Gin Phe 
95 

Arg Glu Ser 
110 

Glu Ala Thr 



His Gly 
130 



<210> 5 
<211> 1575 
<212> DNA 

<213> Homo sapiens 
<400> 5 

ttgtttgtgt atcggaagaa tcatcgtgac tttgagtcag atattattga ctcttcggca 60 
ctcaatgggg gctttcagcc tgtgaacatc aaggcagcaa gacaagacct cacgtcagct 120 
gcagccatgt acagaggacc tgtctatgcc ctgcatgacg tctcagacaa aatcccaatg 180 
accaactctc caattctgga tccactgccc aacctgaaaa tcaaagtgta caacacctca 240 
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ggtgctgtca 
acccagtcgt 
actgatccat 
cccaattcag 
gaaatgtatg 
acacttttga 
gtcctcacta 
aaccaggcag 
accccctgct 
acctacgccc 
atctttgggc 
gacacccagg 
nnnnnnnnnn 
ncgatatggc 
tttaccatgt 
ttagcctgaa 
cagatcttcc 
ctggatcctg 
cctatccggc 
aggatgctgg 
agcccaactg 
agcatgctgg 
gcagaagggc 



ccccccaaga 
tgttggagaa 
cctgtaccgc 
gagtcagctt 
tgactgtaca 
cccctgtggt 
tgcatcactg 
cacagggaca 
acattaagct 
tggtaggaca 
ccctgtgctg 
atgccctgaa 
nnnaggtttt 
ccattccctc 
ttggagtgga 
cacagtggag 
agctcaactg 
cgaacaccat 
agaagctctg 
cccataagct 
gcgtaatcct 
cagctgtctt 
agtat 



tgacctctct 
tgaagccctc 
atttggcagc 
gctgattccc 
caggaaagaa 
gagctgtggg 
cgcagacccc 
gtgggaggat 
ggatgcagag 
ttccaccacc 
ctcctcgctg 
ggaaatttta 
tcatttaaag 
tgaaagagca 
tctcaaagaa 
ctggtttgca 
caccgtgtca 
caccacggtc 
tagcagcctg 
gaacctggac 
ggatctttgg 
ggaagaaatg 



gagtttacgt 
agcctgaaga 
ttcaactcgc 
gctggggcca 
actatgaggc 
cccccaggag 
aataccgagg 
gtggtggtgg 
gcctgccaca 
aaagcggctg 
gagtacagca 
catcttgaga 
cangcanccn 
aattgctggc 
ncctgcactg 
aactctgtgt 
gaggaaccta 
acggggccca 
gatgcccccc 
gggtacttga 
gaagcacaga 
ggaagacatg 



ccaagctgrc 
accagagtct 
tgggaggtca 
ttccccaagg 
cacccatgga 
ctctgctcac 
actggaaaat 
ccggggagga 
tcctcacaga 
caaagcgcct 
tccgagtcta 
gannnnnnnn 
ncaaatgngc 
taaatatcag 
cacnttcact 
cggcaggtgg 
ctggcatcga 
gtgctttcag 
agacgagagg 
attactttgc 
acttcccaga 
aaacggtggt 



ccctcagatg 
agcaaggcag 
ccttattgtt 
gagagtctac 
tgactctcag 
ccqccccgcc 
actgctcaag 
aaacttcacc 
gaacctcagc 
caagctggcc 
ctgtctggat 
nnnnnnnnnn 
ctgtcaattc 
gaaattccat 
ctggaaagat 
aaggagaagg 
tttgccgctg 
catccctctc 
ccatgactgg 
caccaaatcc 
tggaaacctg 
gtccttagca 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1575 



<210> 6 
<211> 526 
<212> PRT 

<213> Homo sapiens 
<400> 6 

Leu Phe Val Tyr Arg Lys Asn His Arg Asp Phe 
1 5 10 



Glu Ser Asp lie lie 
15 



Asp Ser Ser Ala Leu Asn Gly Gly Phe Gin Pro Val Asn lie Lys Ala 
20 25 30 

Ala Arg Gin Asp Leu Thr Ser Ala Ala Ala Met Tyr Arg Gly Pro Val 
35 40 45 

Tyr Ala Leu His Asp Val Ser Asp Lys lie Pro Met Thr Asn Ser Pro 
50 55 60 

lie Leu Asp Pro Leu Pro Asn Leu Lys lie Lys Val Tyr Asn Thr Ser 
65 70 75 80 

Gly Ala Val Ser Pro Gin Asp Asp Leu Ser Glu Phe Thr Ser Lys Leu 

85 90 95 

Ser Pro Gin Met Thr Gin Ser Leu Leu Glu Asn Glu Ala Leu Ser Leu 
100 105 110 

Lys Asn Gin Ser Leu Ala Arg Gin Thr Asp Pro Ser Cys Thr Ala Phe 
115 120 125 

Gly Ser Phe Asn Ser Leu Gly Gly His Leu lie Val Pro Asn Ser Gly 
130 135 140 



Val Ser Leu Leu lie Pro Ala Gly Ala lie Pro Gin Gly Arg Val Tyr 
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145 150 155 160 

Glu Mer Tyr Val Thr Val His Arg Lys Glu Thr Met Arg Pro Pro Met 
165 170 175 

Asp Asp Ser Gin Thr Leu Leu Thr Pro Val Val Ser Cys Glv Pro Pro 
l«o 185 190 

Gly Ala Leu Leu Thr Arg Pro Val Val Leu Thr Met His His Cys Ala 
195 200 205 

Asp Pro Asn Thr Glu Asp Trp Lys He Leu Leu Lys Asn Gin Ala Ala 
210 215 220 

Gin Glv Gin Trp Glu Asp Val Val Val Val Gly Glu Glu Asn Phe Thr 
225 230 235 240 

Thr Pro Cys Tyr He Lys Leu Asp Ala Glu Ala Cys His He Leu Thr 
245 250 255 

Glu Asn Leu Ser Thr Tyr Ala Leu Val Gly His Ser Thr Thr Lys Ala 
260 265 270 

Ala Ala Lys Arg Leu Lys Leu Ala He Phe Gly Pro Leu Cys Cys Ser 
275 " 280 285 

Ser Leu Glu Tyr Ser He Arg Val Tyr Cys Leu Asp Asp Thr Gin Asp 
290 v 295 " 300 

Ala Leu Lys Glu lie Leu His Leu Glu Arg Gin Thr Gly Gly Gin Leu 
305 310 315 320 

Leu Glu Glu Pro Lys Ala Leu His Phe Lys Gly Ser Thr His Asn Leu 
325 330 335 

Arg Leu Ser lie His Asp lie Ala His Ser Leu Trp Lys Ser Lys Leu 
340 * 345 350 

Leu Ala Lys Tyr Gin Glu lie Pro Phe Tyr His Val Trp Ser Gly Ser 
355 360 365 

Gin Arg Asn Leu His Cys Thr Phe Thr Leu Glu Arg Phe Ser Leu Asn 
370 375 380 

Thr Val Glu Leu Val Cys Lys Leu Cys Val Arg Gin Val Glu Glv Glu 
385 390 395 400 

Gly Gin He Phe Gin Leu Asn Cys Thr Val Ser Glu Glu Pro Thr Gly 
405 410 415 

He Asp Leu Pro Leu Leu Asp Pro Ala Asn Thr lie Thr Thr Val Thr 
420 425 430 

Gly Pro Ser Ala Phe Ser He Pro Leu Pro lie Arg Gin Lys Leu Cys 
435 440 445 

Ser Ser Leu Asp Ala Pro Gin Thr Arg Gly His Asp Trp Arg Met Leu 
450 455 460 
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Aia His Lys Leu Asn Leu Asp Arg Tyr Leu Asn Tyr Phe Ala Thr Lys 
465 470 475 480 

Ser Ser Pro Thr Gly Val lie Leu Asp Leu Trp Glu Ala Gin Asn Phe 
485 490 495 

Pro Asp Gly Asn Leu Ser Met Leu Ala Ala Val Leu Glu Glu Mez Gly 
500 505 510 

Arg His Glu Thr Val Val Ser Leu Ala Ala Glu Gly Gin Tyr 
515 520 525 



<210> 7 
<211> 813 
<212> DNA 

<213> Homo sapiens 



<400> 7 

cagtgacctc 

natcgccgtg 

gaagaaggag 

gcccgtcagc 

cctcagcacc 

ccccaagttc 

actgcaccac 

gaactacttc 

cctcgggggc 

ccataccccg 

aggttgcccc 

tggcgtcctg 

tgacagctgg 

gcacctgggc 



tgtgtacaca 
gccgnntgcc 
gggctggact 
atcaagccca 
accaccacca 
cagctcacca 
agctctccca 
cgctccctgc 
cggctgatga 
agggaagatc 
tagctggctg 
ctcacccggc 
agcctgcgcc 
gaggaggcgc 



ctgcttctgg 
tggtcctgct 
canatgtggc 
gcaaagcaga 
cctaccaggg 
atgggcacct 
cctctgaggc 
cccgaggcac 
tccctaatac 
tatgagatct 
tcagaccctg 
cagtcatcct 
tcaaaaagca 
cctcccactc 



ccctgaggac 
gctgcttgtc 
tgactcgtcc 
caacccccat 
cagtctctgt 
gctcagcccc 
cgaggagttc 
cagcaacatg 
aggaatcaag 
acctcacgct 
ctgagtccca 
ggctatggac 
gtcgtgcgag 
act 



gtggccctct 
ctcatcctcg 
attctcacct 
ctgctcacca 
ccccggcagg 
ctgggtggcg 
gtctcccgcc 
acctatggga 
cctcctcatc 
tgcacaagcc 
tcgttagctt 
cactgtgggg 
ggcagctggg 



atgtgggcct 
tttattgccg 
caggcttcca 
tccagccgga 
atgggcccag 
gccgccacac 
tctccaccca 
ccttcaactt 
cccccagatg 
ggaagacgtg 
gtggaccccc 
agcccagccc 
aggatgtgct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

813 



<210> 8 
<211> 265 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Gin Pro Leu Cys Thr His Cys Phe Trp Pro Gly Arg Gly Pro Leu Cys 
15 10 15 

Gly Pro Xaa Arg Arg Gly Xaa Xaa Pro Gly Pro Ala Ala Ala Cys Pro 
20 25 30 

His Pro Arg Leu Leu Pro Glu Glu Gly Gly Ala Gly Leu Xaa Gys Gly 
35 40 45 

Leu Val His Ser His Leu Arg Leu Pro Ala Arg Gin His Gin Ala Gin 
50 55 60 

Gin Ser Arg Gin Pro Pro Ser Ala His His Pro Ala Gly Pro Gin His 
65 70 75 80 

His His His His Leu Pro Gly Gin Ser Leu Ser Pro Ala Gly Trp Ala 

85 90 95 
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Gin Pro 



Trp Arg 



Val Arg 
130 

Arg His 
145 



Gin Val Pro Ala His Gin Trp Ala Pro Ala Gin 
100 105 

Pro Pro His Thr Ala Pro Gin Leu Ser His Leu 
115 120 125 

Leu Pro Pro Leu His Pro Glu Leu Leu Pro Leu 
135 140 

Gin Gin His Asp Leu Trp Asp Leu Gin Leu Pro 
150 155 



Ala Asp Asp Pro 
Tyr Pro 
Glu Asp 



Glu Gly 
180 



Tyr Arg Asn Gin Ala Ser Ser Ser Pro 
165 170 

Arg Ser Met Arg Ser Thr Ser Arg Leu 
185 



Pro Pro Gly 
110 

Gly Arg Gly 
Pro Ala Pro 



Arg Gly Pro 
160 

Gin Met Pro 
175 

His Lys Pro 
190 



Val Arg 
195 



Leu Pro Leu Ala Gly Cys Gin Thr Leu Leu Ser Pro 
200 205 



He Val 
210 

Pro Gly 
225 

Pro Gin 
Gly Arg 



Ser Leu Trp Thr Pro Trp Arg Pro Ala His Pro Ala Ser His 
215 220 



Tyr Gly Pro Leu Trp Gly Ala Gin Pro Gin Leu 
230 235 

Lys Ala Val Val Arg Gly Gin Leu Gly Gly Cys 
245 ^ 250 

Gly Gly Ala Leu Pro Leu Thr 
260 265 



Glu Pro Ala 
240 

Ala Ala Pro 
255 



<210> 9 

<211> 266 

<212> PRT 

<213> Homo sapiens 

<400> 9 

Ser Asp Leu Cys Val His Thr Ala Ser Gly Pro Glu Asp Val Ala Leu 
15 10 15 

Tyr Val Gly Xaa He Ala Val Ala Xaa Cys Leu Val Leu Leu Leu Leu 
20 25 30 

Val Leu He Leu Val Tyr Cys Arg Lys Lys Glu Gly Leu Asp Ser Xaa 
35 40 45 

Val Ala Asp Ser Ser He Leu Thr Ser Gly Phe Gin Pro Val Ser lie 
50 55 60 

Lys Pro Ser Lys Ala Asp Asn Pro His Leu Leu Thr He Gin Pro Asp 
65 " 70 75 80 

Leu Ser Thr Thr Thr Thr Thr Tyr Gin Gly Ser Leu Cys Pro Arg Gin 
85 90 95 
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Asp Gly Pro Ser Pro Lys Phe Gin Leu Thr Asn Gly His Leu Leu Ser 
100 105 110 

Pro Leu Gly Gly Gly Arg His Thr Leu His His Ser Ser Pro Thr Ser 
115 120 125 

Glu Ala Glu Glu Phe Val Ser Arg Leu Ser Thr Gin Asn Tyr Phe Arg 
130 135 140 

Ser Leu Pro Arg Gly Thr Ser Asn Met Thr Tvr Gly Thr Phe Asn Phe 
145 150 155 160 

Leu Gly Gly Arg Leu Met lie Pro Asn Thr Gly lie Lys Pro Pro His 
165 170 175 

Pro Pro Arg Cys His Thr Pro Arg Glu Asp Leu Asp Leu Pro His Ala 
180 185 190 

Cys Thr Ser Arg Lys Thr Gly Cys Pro Leu Ala Val Arg Pro Cys Val 
195 200 205 

Pro Ser Leu Ala Cys Gly Pro Pro Gly Val Leu Leu Thr Arg Pro Val 
210 215 220 

He Leu Ala Met Asp His Cys Gly Glu Pro Ser Pro Asp Ser Trp Ser 
225 230 235 240 

Leu Arg Leu Lys Lys Gin Ser Cys Glu Gly Ser Trp Glu Asp Val Leu 
245 250 255 

His Leu Gly Glu Glu Ala Pro Ser His Ser 
260 265 



<210> 10 
<211> 266 
<212> PRT 

<213> Homo sapiens 
<400> 10 

Val Thr Ser Val Tyr 
1 5 

Met Trp Ala Xaa Ser 
20 

Ser Ser Ser Ser Phe 
35 

Trp Leu Thr Arg Pro 
50 

Ser Pro Ala Lys Gin 
65 

Ser Ala Pro Pro Pro 

85 



Thr Leu Leu Leu Ala 

10 

Pro Trp Pro Xaa Ala 
25 



He Ala Gly Arg Arg 
40 

Phe Ser Pro Gin Ala 
55 

Thr Thr Pro He Cys 
70 

Pro Pro Thr Arg Ala 

90 



Leu Arg Thr Trp Pro Ser 

15 



Tro Ser Cys Cys Cys Leu 
30 



Arg Gly Trp Thr Xaa Met 
45 

Ser Ser Pro Ser Ala Ser 
60 

Ser Pro Ser Ser Arg Thr 
75 80 

Val Ser Val Pro Gly Arg 

95 



BNSDOCID: <WO 0073328A2_l_> 



WO 00/73328 



PCT/EPOO/05108 



Met Giv Pro Ala Pro Ser Ser Ser Ser Pro Met Gly Thr Cys Ser Ala 
100 105 110 

Pro Tro Val Ala Ala Ala Thr His Cys Thr Thr Ala Leu Pro Pro Leu 
115 120 125 

Arg Pro Arg Ser Ser Ser Pro Ala Ser Pro Pro Arg xnr Thr Ser Ala 
130 135 140 

Pro Cvs Pro Glu Ala Pro Ala Thr Pro Met Gly Pro Ser Thr Ser Ser 
145 150 155 160 

Gly Ala Gly Ser Leu lie Gin Glu Ser Ser Leu Leu lie Pro Pro Asp 
165 170 175 

Ala lie Pro Arg Gly Lys lie Tyr Glu lie Tyr Leu Thr Leu Ala Gin 
180 185 190 

Ala Giv Arg Arg Glu Val Ala Pro Ser Trp Leu Ser Asp Pro Ala Glu 
195 200 205 

Ser His Arg Leu Val Asp Pro Leu Ala Ser Cys Ser Pro Gly Gin Ser 
210 ' 215 220 

Ser Trp Leu Trp Thr Thr Val Gly Ser Pro Ala Leu Thr Ala Gly Ala 
225 230 235 240 

Cys Ala Ser Lys Ser Ser Arg Ala Arg Ala Ala Gly Arg Met Cys Cys 
245 250 255 

Thr Trp Ala Arg Arg Arg Pro Pro Thr His 
260 265 



<210> 11 
<211> 6981 
<212> DNA 

<213> Caenorhabditis elegans 
<400> 11 

atgtcgtcgg tggatgacga gcatcatcct tcaattgaag ttgatattcc tgacgtttat 60 

gaaagzgcat ctgactgtga aaaaactgaa gactgcacct atctaggatc tattctcaag 120 

gcgaaaaaat cattgcgaaa aaccaagaaa gtcgctagtc ctggtatcta tccgggctcg 180 

gcgggtgctc tttcatctct ttcaaatggc ctagccgatg agaatagatc gccaactagt 240 

gctgaagttt ctgattcagt atgtggacaa aaatctataa attcagttga cttgaggttc 300 

agaggattga gagatgaacg tgaactcgtg caaaagaaaa catttacaaa atgggtcaac 360 

tcacacttgg ttcgagtcag ctgcaaagta caagacctct acatggatat gcgagacgga 420 

aaaatgctcc tccgactcct cgcagttctt tccggtgaac gtctgccgaa accgacgccg 480 

ggaaaaatgc gtatccactg tcttgagaac gttgagaaag gtttgcaatt cttgcgcaat 540 

caacacgtgc atctcgaaaa cttgggttca cacgatatcg ttgacggaaa ttcacgcctc 600 

acgttgggtc tcatctggac gatcatcctt cgattccaga tccaagatat cacatttgaa 660 

gacgctgaca atcacgagac acgttcagcg aaggaggcct tgttactgtg gtgccagatg 720 

aaaactgctg gatatccaaa tgtcaacgtc aagaacttct caaccagttg gagagatggt 780 

cttgcattca atgcacttat tcataaacat agacctgatc tcgtagacta cgataattta 840 

caaaaatcga atgctcttta caatttgcaa tctgcttttg ataccgcaga aaatcaattg 900 

ggattggcca agttcctcga tgccgaagat gtgaacgttg accaacctga tgagaaatcc 960 

attatcactt atgtagttac atattatcat tacttcaaca aactgaaaca agataacatc 1020 
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caaggaaaac 
cgatacgaaa 
aacgaacgtc 
gactaccgta 
ctgttcactc 
ggaaaactga 
cgagaacttg 
cgattcaaca 
agtcaggaca 
gccatcgaga 
ggagagcttg 
gttcttcaac 
ttgtcaatgg 
gatattaaga 
ttgcttcaaa 
aattctattg 
aagccagttg 
gagcttcttg 
cagttctggt 
tcgtccactg 
aacgctgaga 
aaagaattgc 
attcgtgatt 
ggtggagtcg 
gatactctcc 
ctcctcaaga 
gttcttcacg 
caacgattgg 
aagcagagac 
gagtcttgga 
gaagaggttg 
gaagctaagg 
aactccgacg 
gacatggtcg 
atcgattgcc 
gacgcgttga 
gaacgcgatt 
atcgaaagag 
caagtctggg 
ggagatcttc 
cagagacaag 
aatcaacacg 
cgtgcaatgg 
caacgtcttg 
caacatttgc 
gaggttatgc 
gagcaagccg 
gacgaaaaga 
gccgctgaca 
gagaaggctc 
tccgactgcg 
acttacagag 
gaacttgctg 
gacgacaagc 
tgggatgagt 
caacaacttt 
gagatgactc 
caacatttga 



gtatcggaaa 
ctctttcatc 
attttgagaa 
ctcaggagaa 
ttcaatcagc 
ttgctgatat 
tcctgaaaga 
gaaaggccga 
atttcggaaa 
cggacatctt 
aagccgagaa 
tctggaacta 
ccattcaaaa 
gcagactttt 
agcacgctct 
ctcaagctca 
aaccaggaac 
atttggctgc 
gggatgttgc 
atactggccg 
acaaccttcg 
aagatgaaag 
acatcaacaa 
aatactacca 
gcgtcatgtc 
agcacgacga 
ccaaggccga 
acacaactct 
ttatcgatgc 
tcgatgagaa 
agatcatgaa 
ttactaacgt 
acattcttca 
atcaaaagag 
aagaaacagt 
ccaacgatct 
tgggagcgat 
aacgaccaca 
atatccttaa 
aacgcttcct 
ttgcctcaga 
ctgccatccg 
gagatcgtgt 
ctggacttca 
tttctcaagg 
tctctcaaca 
agaaccaact 
ttcgcgccgt 
agatccataa 
aagaagtact 
acgagctgag 
acgccaagac 
ctaacaagga 
cagagtacca 
tggaaaagac 
acgtacaaag 
gtgaagacca 
tcgaaactga 



agtgattaat 
tgatttgctc 
taatttggaa 
accaccaaag 
tatgcgtgcc 
caaccgagca 
ggaactcatt 
gatgcgagag 
cgacttgtct 
tgcttacgag 
ctaccatgat 
tcttttccaa 
gatcttccac 
aagcgaggat 
tcttgagtct 
acgcttccga 
cattgatgag 
tgaacgtaag 
tgagcttgaa 
tgatattgtc 
cgacttggag 
catcccagga 
gctcaaggaa 
attcttcacc 
atccgaagac 
tgtccatgac 
gtctcttcca 
caagcagaag 
tctctccctg 
gggaaaactt 
gcatcgtttc 
caatgacttg 
ccgtcaaaac 
aaacgagctc 
tacttggatc 
ttccggtgtc 
tcaagccaag 
agaagctcaa 
caagaaagtt 
ccgtgatttg 
ggaagagcca 
tgaagagatt 
cactcaagat 
agaaggatgg 
actcaacctt 
agaaaactat 
caagagacac 
cggaatgttc 
gaaggctaga 
caagaagctg 
agaatggatc 
tatcacctcc 
acgtcttgac 
cggaaccatt 
tacagaggag 
tattgctgac 
accaggagat 
aatgatcaag 



10 

gagttgatgg 
gaatggatca 
ggtgttcaaa 
ttcgatgaaa 
aacaatcaga 
tggcaatccc 
cgtcaagaga 
acttggctca 
tcggtggaag 
gagcgtgttc 
caagccaaaa 
cttcttttgg 
gatatgcttc 
cttggagccc 
gatattaata 
aatccagatg 
agaagtgatg 
cgtcgtctcg 
catggtatta 
actgtttcgc 
aagtacctcg 
agtgacaaca 
ctctctgctt 
gacgccgacg 
gttggaaaag 
gagctccaga 
caggaagcca 
gccgaactcg 
tacaagttgt 
ttggcaacat 
gatactcttg 
gctcgtcaat 
aagctcaacg 
gagagagctc 
gaggacaaga 
atgaaacttc 
ctcgactcgc 
gccattcgtg 
cgtgagcacg 
gatcacttcc 
caatccttgg 
gacggatacg 
caaaccgatc 
gaagagttgc 
caaatgttcc 
ctcgccaagg 
caagacttca 
ggtgatcagc 
aacattgatg 
aaggacgcct 
gaggagaaga 
aaattcgtcc 
caacttaaac 
gatccacaga 
aagggacaga 
atgaaggaat 
cttaccactg 
aaggctcaac 



agaacgacaa 
atgccaagat 
gacagqtcac 
agggagagct 
gacctttcgt 
ttgagaaggc 
aactcgaaca 
cagaaaacca 
ctgccaccaa 
aagccgtcgt 
tcaatgagag 
ctcgcagagt 
ttactctcga 
atcttatgga 
ttattggaga 
gaccagatgg 
ttctacaaaa 
aagacaacaa 
aggaacaaga 
atttgttggc 
atcgtcttga 
ttcctccaag 
ctcgcaaaga 
atgtcgatcg 
acgagggaac 
actttgacca 
gagaacatcc 
aaaacctttc 
actcggatgc 
tggtccctgg 
agcaagatat 
tgcttaatgt 
ctcgctgggc 
accgcctcga 
ctcgtgtact 
aaagaagatt 
ttcacaaaga 
aagatatcaa 
aagctaaact 
aagcctggct 
ctgaagctga 
ctgaggatta 
cacaatacat 
agcgaatgtg 
ttcgcgatgc 
acgatatccc 
tcaccactat 
tctgccaaga 
aacgtcgtgg 
tgtctctcca 
tgatccgtgc 
gacatcaagc 
atgctgccat 
tcgaggaact 
agttgtttga 
gggcaaccca 
tcaatgtggc 
acattgatca 



aatgatcaat 
ccaacttctc 
cgagttcaac 
cgaagtactt 
gccgagagaa 
agaacacgaa 
actcgcagcc 
acggcttgtc 
aaagcacgaa 
cgctgttgct 
aaaggagaat 
tcgattggag 
tctcatggat 
cgtagaagat 
acgagttaac 
aagtggatac 
acgttacaag 
gagactttgc 
gcaagttctt 
caaacataag 
tgtttctgga 
gcttgccgag 
aagacttgcc 
ctacttgtat 
tgttcaactt 
acacatcaag 
agatatcaga 
tcaacttcgc 
cgattcagtc 
acgcgatatc 
gaagaatcaa 
ggaacatcca 
tcaacttcgc 
aactttccgc 
cgaggattcg 
gtccatgatg 
agccgatgac 
gagaatccac 
cgatgaagct 
tactgccacc 
gcaacttctc 
caagaagatg 
gttccttcgt 
ggacaacaga 
caaacaagca 
acaatctttg 
ggatgccaac 
tggtcactat 
agccaaccgc 
acaattcttg 
tcaagatgag 
attccaatcc 
caaccttggt 
cgctactcaa 
tgctaaccgc 
acttgagaac 
catgcaaaaa 
actcatggag 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
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atggagccac 
ttggcagttc 
ttggaacgca 
atttctgaga 
catcgtctcc 
atcaaccaga 
gcattcgaga 
aaagacagaa 
ggagaggctg 
aaagacgagt 
gacaaatttg 
aagtcaccac 
ggtcttcaag 
gctcttcatc 
ggaagtcaag 
cagtttgctc 
tgtgatactc 
tcgttgaatg 
gaggcttcta 
atggaaaaga 
ctctccagaa 
caaatcgaaa 
atcggatctc 
gcccgtacat 
gatttgttgt 
gatgtgagcg 
gctcgcgagg 
cactacgcat 
atgatgcgca 
ttcgcccgcg 
tcgaaggaat 
ttcgagaaat 
ttcgaattga 
catattggaa 
gatggagcca 
agtctcgacc 
aatgaactgt 
ccaatggcac 
gtactgtcgc 
gacatgcagc 
gctagtagat 
ttctctcgag 



agctcgagga 
aagaacaact 
agaaggccgc 
gacttgttct 
agaagaacct 
tctgtaataa 
aqaaqattca 
agggagatct 
aagcctggat 
tctcaaccaa 
ctgatacgat 
tcaccgaaca 
atttgtcgaa 
gtgaaatcga 
agaacggaca 
gtgacaccga 
tcattggtca 
aggcctggga 
gacttcttca 
ctcatgctat 
aacatcagaa 
gagatgccgc 
gcgaaagcga 
ctcgtctcat 
tgtggatgga 
gagtcgagtt 
agaacttcaa 
caagtgaaat 
gatgggaaga 
acgctgctgt 
acggtagaaa 
cggcattcgc 
aagaaactca 
gtccttctag 
aacaaggaga 
ggaaagcggc 
cattctacaa 
tgccaggatg 
tgaggttacc 
gatggctcac 
cacagacgct 
gcaagaagta 



acttcaccca 
tcaaagactc 
cttccaattt 
cgctaaagct 
acaacttctt 
tggtcaagag 
agaactgcgc 
tggagaaagc 
gagcgaacaa 
gaaccaaatc 
ccgtgctctt 
gatccaagtt 
ggagcgcaga 
cgatcttctt 
agactacgaa 
aaacatcgga 
cggacacact 
gaatcttctg 
caagttctac 
gcctgatgat 
ctatctcaag 
agagcttcga 
ggttgtcaag 
ggatacttct 
tgaggtgaag 
gctcatgaac 
cgcgtgcatc 
cgagaagaag 
tcgttgggaa 
cgccgaatct 
cctcgaagaa 
acaagaagaa 
acaccgcgag 
gtcaactcca 
agcattcgaa 
taaccgctca 
agatccgaag 
ctctgtcaac 
catcggcgcc 
cgagttgcaa 
accagcagaa 
a 
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gacgagcttg 
caggctccat 
ggtcgcgatg 
caaaatcttg 
tccaatgaaa 
ttgattgacg 
agtgcttggc 
gagaaggctc 
gaattgtaca 
aagaaacacg 
gccaccaagg 
cgtcaagctc 
aagagactcg 
caatggatcg 
cacgtccaaa 
tccgaacgtg 
gacgctccga 
gagttgatgg 
cacgactgcc 
cttggacgtg 
gatattgctg 
gatggttacg 
gcatggaggc 
gatcttttca 
agagaaatga 
aatcatcaat 
tcattgggac 
ctgatcaagt 
tatcttcagc 
tggctcttcg 
accatcaaac 
cgtttcttgg 
gaggagaccg 
gccgcggaaa 
ggaacgctta 
tgggagaaac 
catagagatg 
gtggcatcag 
gagtacctat 
gtcgccacgg 
ggcagtgcca 



aaaacattaa 
tggacgaccg 
ttgatgacga 
gagaaagcct 
tcgacaacca 
aaggacatgc 
aagagttgaa 
atcaatrcrt 
tgatgcaaga 
aacgccttca 
ctcacaagtt 
aaatcgagaa 
aagagacctt 
ccgacaagga 
tgcttcaaga 
ttgccaatgc 
ctatagcatt 
acactcgtgc 
gtgattgctt 
actcaagcag 
caattggaga 
ctggagacaa 
atctccgcgg 
agttcatgaa 
actcacaaga 
cgctcaaggc 
gagatctgct 
tgacaaccga 
tcattctgga 
cccaagaacc 
tcatcaagaa 
cgctcgagaa 
ccaagcgacg 
catctttcgg 
tcagaaagca 
tttacgcagt 
aatctgtgca 
actatcaaaa 
tccaatgcgg 
ggcaagccca 
caaagaagaa 



ggctcatcgt 
acgtaaggca 
gaaattgtgg 
tccagattgt 
cgaaccatgg 
taacggacca 
ggaagccgtc 
gtargarrgc 
cgaaagagga 
atccgatatt 
cgtcgaagag 
gctctacgca 
ggaactttac 
agttgttgca 
gagattccaa 
taatgatgga 
gtggaaggat 
ccaaattctc 
gtcaagaatc 
tgttggtgca 
acaagttgct 
ggctcttgat 
actctgtgac 
tatggtccgt 
acgtccgaag 
tgaaatcgat 
caacagaaaa 
aagagccgaa 
agtgtaccaa 
atatcttatt 
acacgaggcg 
attgacaacg 
tggtccagct 
cgcccaggat 
cacttacgag 
gttgcggcag 
tggagagcca 
gaagaagaac 
tagcgaagaa 
actggaagag 
gggaggattt 



4560 
4620 
4680 
4740 
4800 
4860 
4920 
4 960 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
6981 



<210> 12 
<211> 2326 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 12 

Met Ser Ser Val Asp Asp Glu His His Pro Ser lie Glu Val Asp lie 
15 10 15 

Pro Asp Val Tyr Glu Ser Ala Ser Asp Cys Glu Lys Thr Glu Asp Cys 
20 25 30 



Thr Tyr Leu Gly Ser lie Leu Lys Ala Lys Lys Ser Leu Arg Lys Thr 
35 40 45 
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Lys Lys Val Ala 
50 

Ser Ser Leu Ser 
65 

Ala Glu Val Ser 



Asp Leu Arg Phe 
100 

Lys Thr Phe Thr 
115 

Lys Val Gin Asp 
130 



Arg Leu Leu Ala 
145 



Gly Lys Met Arg 



Phe Leu Arg Asn 
180 

lie Val Asp Gly 
195 . 



lie Leu Arg Phe 
210 

His Glu Thr Arg 
225 

Lys Thr Ala Gly 



Trp Arg Asp Gly 
260 



Asp Leu Val Asp 
275 

Leu Gin Ser Ala 
290 

Phe Leu Asp Ala 
305 

lie lie Thr Tyr 



Gin Asp Asn lie 
340 

Met Glu Asn Asp 



Ser Pro 

Asn Gly 
70 

Asp Ser 
85 

Arg Gly 

Lys Trp 

Leu Tyr 

Val Leu 
150 

lie His 
165 

Gin His 

Asn Ser 

Gin lie 

Ser Ala 
230 

Tyr Pro 
245 

Leu Ala 
Tyr Asp 



Phe Asp 

Glu Asp 
310 

Val Val 
325 

Gin Gly 
Lys Met 



Gly lie 
55 

Leu Ala 

Val Cys 

Leu Arg 

Val Asn 
120 

Met Asp 
135 

Ser Gly 

Cys Leu 

Val His 

Arg Leu 
200 

Gin Asp 
215 

Lys Glu 



Asn Val 



Phe Asn 

Asn Leu 
280 

Thr Ala 
295 

Val Asn 
Thr Tyr 
Lys Arg 
lie Asn 



12 

Tyr Pro 

Asp Glu 

Gly Gin 
90 

Asp Glu 
105 

Ser His 

Met Arg 

Glu Arg 

Glu Asn 
170 

Leu Glu 
185 

Thr Leu 

lie Thr 

Ala Leu 

Asn Val 
250 

Ala Leu 
265 

Gin Lys 

Glu Asn 

Val Asp 

Tyr His 
330 

He Gly 
345 

Arg Tyr 



Gly Ser 
60 

Asn Arg 
75 

Lys Ser 

Arg Glu 

Leu Val 

Asp Gly 
140 

Leu Pro 
155 

Val Glu 

Asn Leu 

Gly Leu 

Phe Glu 
220 

Leu Leu 
235 

Lys Asn 

He His 

Ser Asn 

Gin Leu 
300 

Gin Pro 
315 

Tyr Phe 
Lys Val 
Glu Thr 



Ala Gly 

Ser Pro 

He Asn 

Leu Val 
110 

Arg Val 
125 

Lys Met 

Lys Pro 

Lys Gly 

Gly Ser 
190 

He Trp 
205 

Asp Ala 

Trp Cys 

Phe Ser 

Lys His 
270 

Ala Leu 
285 

Gly Leu 

Asp Glu 

Asn Lys 

He Asn 
350 

Leu Ser 



Ala Leu 

Thr Ser 
80 

Ser Val 
95 

Gin Lys 

Ser Cys 

Leu Leu 

Thr Pro 
160 

Leu Gin 
175 

His Asp 

Thr He 

Asp Asn 

Gin Met 
240 

Thr Ser 
255 

Arg Pro 

Tyr Asn 

Ala Lys 

Lys Ser 
320 

Leu Lys 
335 

Glu Leu 



Ser Asp 
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355 



360 



365 



Leu Leu Glu Trp lie Asn Ala Lys lie Gin Leu Leu Asn Glu Arg His 
370 375 380 

Phe Giu Asn Asn Leu Glu Gly Val Gin Arg Gin Leu Thr Glu Phe Asn 
385 390 395 400 

Asp Tyr Arg Thr Gin Glu Lys Pro Pro Lys Phe Asp Glu Lys Gly Glu 
405 410 415 

Leu Glu Val Leu Leu Phe Thr Leu Gin Ser Ala Met Arg Ala Asn Asn 
420 425 430 

Gin Arg Pro Phe Val Pro Arg Glu Gly Lys Leu lie Ala Asp He Asn 
435 440 445 

Arg Ala Trp Gin Ser Leu Glu Lys Ala Glu His Glu Arg Glu Leu Val 
450 455 460 

Leu Lvs Glu Glu Leu He Arg Gin Glu Lys Leu Glu Gin Leu Ala Ala 
465 470 475 480 

Arg Phe Asn Arg Lys Ala Glu Met Arg Glu Thr Trp Leu Thr Glu Asn 
485 490 495 

Gin Arg Leu Val Ser Gin Asp Asn Phe Gly Asn Asp Leu Ser Ser Val 
500 * 505 510 

Glu Ala Ala Thr Lys Lys His Glu Ala He Glu Thr Asp He Phe Ala 
515 520 525 

Tyr Glu Glu Arg Val Gin Ala Val Val Ala Val Ala Gly Glu Leu Glu 
530 535 540 

Ala Glu Asn Tyr His Asp Gin Ala Lys He Asn Glu Arg Lys Glu Asn 
545 550 555 560 

Val Leu Gin Leu Trp Asn Tyr Leu Phe Gin Leu Leu Leu Ala Arg Arg 
565 ~ 570 575 

Val Arg Leu Glu Leu Ser Met Ala lie Gin Lys lie Phe His Asp Met 
580 585 590 

Leu Leu Thr Leu Asp Leu Met Asp Asp He Lys Ser Arg Leu Leu Ser 
595 600 605 

Glu Asp Leu Gly Ala His Leu Met Asp Val Glu Asp Leu Leu Gin Lys 
610 615 620 

His Ala Leu Leu Glu Ser Asp He Asn He He Gly Glu Arg Val Asn 
625 630 635 640 



Asn Ser lie Ala Gin Ala Gin Arg Phe Arg Asn Pro Asp Gly Pro Asp 
645 " 650 655 

Gly Ser Gly Tyr Lys Pro Val Glu Pro Gly Thr He Asp Glu Arg Ser 

660 665 670 
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£sp Val Leu Gin Lys Arg Tyr Lys Glu Leu Leu Asp Leu Ala Ala Glu 
675 680 685 

Arg Lys Arg Arg Leu Glu Asp Asn Lys Arg Leu Cys Gin Phe Trp Trp 
690 695 700 

Asp Val Ala Glu Leu Glu His Gly He Lys Glu Gin Glu Gin Val Leu 
705 710 715 720 

Ser Ser Thr Asp Thr Gly Arg Asp He Val Thr Val Ser His Leu Leu 
725 730 735 

Ala Lys His Lys Asn Ala Glu Asn Asn Leu Arg Asp Leu Glu Lys Tyr 
740 745 750 

Leu Asp Arg Leu Asp Val Ser Gly Lys Glu Leu Gin Asp Glu Ser lie 
755 760 765 

u ro civ Ser Asp Asn He Pro Pro Arg Leu Ala Glu He Arg Asp Tyr 
770 775 780 

Tie Asn Lys Leu Lys Glu Leu Ser Ala Ser Arg Lys Glu Arg Leu Ala 
785 790 795 800 

Gly Gly Val Glu Tyr Tyr Gin Phe Phe Thr Asp Ala Asp Asp Val Asp 
805 810 815 

Arg Tyr Leu Tyr Asp Thr Leu Arg Val Met Ser Ser Glu Asp Val Gly 
820 825 830 

Lvs Asp Glu Gly Thr Val Gin Leu Leu Leu Lys Lys His Asp Asp Val 
835 840 845 

His Asp Glu Leu Gin Asn Phe Asp Gin His He Lys Val Leu His Ala 
850 855 860 

Lys Ala Glu Ser Leu Pro Gin Glu Ala Arg Glu His Pro Asp He Arg 
865 870 875 880 

Gin Arg Leu Asp Thr Thr Leu Lys Gin Lys Ala Glu Leu Glu Asn Leu 
885 890 895 

Ser Gin Leu Arg Lys Gin Arg Leu He Asp Ala Leu Ser Leu Tyr Lys 
900 905 910 

Leu Tyr Ser Asp Ala Asp Ser Val Glu Ser Trp He Asp Glu Lys Gly 
915 " 920 925 

Lys Leu Leu Ala Thr Leu Val Pro Gly Arg Asp He Glu Glu Val Glu 
930 935 940 

He Met Lys His Arg Phe Asp Thr Leu Glu Gin Asp Met Lys Asn Gin 
945 950 955 960 

Glu Ala Lys Val Thr Asn Val Asn Asp Leu Ala Arg Gin Leu Leu Asn 
965 970 975 
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Val Glu His Pro Asn Ser Asp Asp He Leu His Arg Gin Asn Lys Leu 
980 985 990 

Asn Ala Arg Trp Ala Gin Leu Arg Asp Met Val Asp Gin Lys Arg Asn 
995 1000 1005 

Glu Leu Gl» *rn Ala His Ara Leu Glu Thr Phe Arq He Asp Cys Gin 
1010 " 1015 1020 

Glu Thr Val Thr Trp He Glu Asp Lys Thr Arg Val Leu Glu Asp Ser 
1025 1030 1035 1040 

Asp Ala Leu Thr Asn Asp Leu Ser Gly Val Met Lys Leu Gin Arg Arg 
1045 1050 1055 

Leu Ser Met Met Glu Arg Asp Leu Gly Ala He Gin Ala Lys Leu Asp 
1060 1065 1070 

Ser Leu His Lys Glu Ala Asp Asp He Glu Arg Glu Arg Pro Gin Glu 
1075 1080 1085 

Ala Gin Ala He Arg Glu Asp He Lys Arg He His Gin Val Trp Asp 
1090 1095 HOO 

He Leu Asn Lys Lys Val Arg Glu His Glu Ala Lys Leu Asp Glu Ala 
1105 ' 1110 1H5 H20 

Gly Asp Leu Gin Arg Phe Leu Arg Asp Leu Asp His Phe Gin Ala Trp 
1125 H30 H35 

Leu Thr Ala Thr Gin Arg Gin Val Ala Ser Glu Glu Glu Pro Gin Ser 
1140 1145 H50 . 

Leu Ala Glu Ala Glu Gin Leu Leu Asn Gin His Ala Ala He Arg Glu 
1155 1160 H65 

Glu He Asp Gly Tyr Ala Glu Asp Tyr Lys Lys Met Arg Ala Met Gly 
1170 1175 H80 

Asp Arg Val Thr Gin Asp Gin Thr Asp Pro Gin Tyr Met Phe Leu Arg 
1185 1190 1195 1200 

Gin Arg' Leu Ala Gly Leu Gin Glu Gly Trp Glu Glu Leu Gin Arg Met 
1205 1210 1215 

Trp Asp Asn Arg Gin His Leu Leu Ser Gin Gly Leu Asn Leu Gin Met 
1220 1225 1230 

Phe Leu Arg Asp Ala Lys Gin Ala Glu Val Met Leu Ser Gin Gin Glu 
1235 " ~ 1240 1245 

Asn Tyr Leu Ala Lys Asp Asp He Pro Gin Ser Leu Glu Gin Ala Glu 
1250 4 1255 1260 

Asn Gin Leu Lys Arg His Gin Asp Phe He Thr Thr Met Asp Ala Asn 
1265 A 1270 1275 1280 

Asp Glu Lys lie Arg Ala Val Gly Met Phe Gly Asp Gin Leu Cys Gin 
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1285 1290 1295 

Asp Gly His Tyr Ala Ala Asp Lys lie His Lys Lys Ala Arg Asn He 
1300 1305 1310 

Asp Glu Arg Arg Gly Ala Asn Arg Glu Lys Ala Gin Glu Val Leu Lys 
1315 1320 1325 

Lys Leu Lys Asp Ala Leu Ser Leu Gin Gin Phe Leu Ser Asp Cys Asp 
1330 * 1335 1340 

Glu Leu Arg Glu Trp He Glu Glu Lys Met He Arg Ala Gin Asp Glu 
1345 1350 1355 1360 

Thr Tyr Arg Asp Ala Lys Thr He Thr Ser Lys Phe Val Arg His Gin 
1365 1370 1375 

Ala Phe Gin Ser Glu Leu Ala Ala Asn Lys Glu Arg Leu Asp Gin Leu 
1380 1385 1390 

Lys His Ala Ala He Asn Leu Gly Asp Asp Lys Pro Glu Tyr His Gly 
1395 1400 1405 

Thr lie Asp Pro Gin He Glu Glu Leu Ala Thr Gin Trp Asp Glu Leu 
1410 1415 1420 

Glu Lys Thr Thr Glu Glu Lys Gly Gin Lys Leu Phe Asp Ala Asn Arg 
1425 ' 1430 1435 1440 

Gin Gin Leu Tyr Val Gin Ser lie Ala Asp Met Lys Glu Trp Ala Thr 
1445 1450 1455 

Gin Leu Glu Asn Glu Met Thr Arg Glu Asp Gin Pro Gly Asp Leu Thr 
1460 1465 1470 

Thr Val Asn Val Ala Met Gin Lys Gin His Leu He Glu Thr Glu Met 
1475 1480 1485 

He Lys Lys Ala Gin His He Asp Gin Leu Met Glu Met Glu Pro Gin 
1490 1495 1500 

Leu Glu Glu Leu His Pro Asp Glu Leu Glu Asn lie Lys Ala His Arg 
1505 1510 1515 1520 

Leu Ala Val Gin Glu Gin Leu Gin Arg Leu Gin Ala Pro Leu Asp Asp 
1525 1530 1535 

Arg Arg Lys Ala Leu Glu Arg Lys Lys Ala Ala Phe Gin Phe Gly Arg 
1540 1545 1550 

Asp Val Asp Asp Glu Lys Leu Trp lie Ser Glu Arg Leu Val Leu Ala 
1555 1560 1565 

Lys Ala Gin Asn Leu Gly Glu Ser Leu Pro Asp Cys His Arg Leu Gin 
1570 1575 1580 

Lys Asn Leu Gin Leu Leu Ser Asn Glu lie Asp Asn His Glu Pro Trp 
1585 1590 1595 1600 
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He Asn Gin He Cys Asn Asn Gly Gin Glu Leu He Asp Glu Gly His 
1605 1610 1615 

Ala Asn Gly Pro Ala Phe Glu Lys Lys He Gin Glu Leu Arg Ser Ala 
1620 1625 1630 

Trp Gin Glu Leu Lys Glu Ala Val Lys Asp Arg Lys Gly Asp Leu Gly 
1635 1640 1645 

Glu Ser Glu Lys Ala His Gin Phe Leu Tyr Asp Cys Gly Glu Ala Glu 
1650 1655 1660 

Ala Tro Met Ser Glu Gin Glu Leu Tyr Met Met Gin Asp Glu Arg Gly 
1665 ' 1670 1675 1680 

Lys Aso Glu Phe Ser Thr Lys Asn Gin He Lys Lys His Glu Arg Leu 
1685 1690 1695 

Gin Ser Asp lie Asp Lys Phe Ala Asp Thr He Arg Ala Leu Ala Thr 
1700 1705 1710 

Lys Ala His Lys Phe Val Glu Glu Lys Ser Pro Leu Thr Glu Gin He 
1715 1720 1725 

Gin Val Arg Gin Ala Gin He Glu Lys Leu Tyr Ala Gly Leu Gin Asp 
1730 1735 1740 

Leu Ser Lys Glu Arg Arg Lys Arg Leu Glu Glu Thr Leu Glu Leu Tyr 
1745 ~ 1750 1755 1760 

Ala Leu His Arg Glu He Asp Asp Leu Leu Gin Trp He Ala Asp Lys 
1765 1770 1775 

Glu Val Val Ala Gly Ser Gin Glu Asn Gly Gin Asp Tyr Glu His Val 
1780 1785 1790 

Gin Met Leu Gin Glu Arg Phe Gin Gin Phe Ala Arg Asp Thr Glu Asn 
1795 1800 1805 

He Gly Ser Glu Arg Val Ala Asn Ala Asn Asp Gly Cys Asp Thr Leu 
1810 " 1815 1820 

He Gly His Gly His Thr Asp Ala Pro Thr He Ala Leu Trp Lys Asp 
1825 ^ 1830 1835 1840 

Ser Leu Asn Glu Ala Trp Glu Asn Leu Leu Glu Leu Met Asp Thr Arg 
1845 * 1850 1855 

Ala Gin He Leu Glu Ala Ser Arg Leu Leu His Lys Phe Tyr His Asp 
1860 1865 1870 

Cys Arg Asp Cys Leu Ser Arg He Met Glu Lys Thr His Ala Met Pro 
1875 1880 1885 

Asp Asp Leu Gly Arg Asp Ser Ser Ser Val Gly Ala Leu Ser Arg Lys 
1390 ~ " 1895 1900 
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His Gin Asn Tyr Leu Lys Asp He Ala Ala lie Gly Glu Gin Val Ala 
1905 1910 1915 1920 

Gin lie Glu Arg Asp Ala Ala Glu Leu Arg Asp Gly Tyr Ala Gly Asp 
!925 1930 1935 

Lys Ala Leu Asp He Gly Ser Arg Glu Ser Glu Val Val Lys Ala Trp 
1940 1945 1950 

Arg His Leu Arg Gly Leu Cys Asp Ala Arg Thr Ser Arg Leu Met Asp 
1955 I960 1965 

Thr Ser Asp Leu Phe Lys Phe Met Asn Met Val Arg Asp Leu Leu Leu 
1970 1975 1980 

Trp Met Asp Glu Val Lys Arg Glu Met Asn Ser Gin Glu Arg Pro Lys 
1985 1990 1995 2000 

Asp Val Ser Gly Val Glu Leu Leu Met Asn Asn His Gin Ser Leu Lys 
2005 2010 2015 

Ala Glu He Asp Ala Arg Glu Glu Asn Phe Asn Ala Cys lie Ser Leu 
2020 2025 2030 

Gly Arg Asp Leu Leu Asn Arg Lys His Tyr Ala Ser Ser Glu lie Glu 
2035 2040 2045 

Lys Lys Leu He Lys Leu Thr Thr Glu Arg Ala Glu Met Met Arg Arg 
2050 * 2055 2060 

Trp Glu Asp Arg Trp Glu Tyr Leu Gin Leu He Leu Glu Val Tyr Gin 
2065 2070 2075 2080 

Phe Ala Arg Asp Ala Ala Val Ala Glu Ser Trp Leu Phe Ala Gin Glu 
2085 2090 2095 

Pro Tyr Leu He Ser Lys Glu Tyr Gly Arg Asn Leu Glu Glu Thr He 
2100 2105 2110 

Lys Leu He Lys Lys His Glu Ala Phe Glu Lys Ser Ala Phe Ala Gin 
2115 " 2120 2125 

Glu Glu Arg Phe Leu Ala Leu Glu Lys Leu Thr Thr Phe Glu Leu Lys 
2130 2135 2140 

Glu Thr. Gin His Arg Glu Glu Glu Thr Ala Lys Arg Arg Gly Pro Ala 
2145 2150 2155 2160 

His lie Gly Ser Pro Ser Arg Ser Thr Pro Ala Ala Glu Thr Ser Phe 
2165 2170 2175 

Gly Ala Gin Asp Asp Gly Ala Lys Gin Gly Glu Ala Phe Glu Gly Thr 
2180 2185 2190 

Leu lie Arg Lys His Thr Tyr Glu Ser Leu Asp Arg Lys Ala Ala Asn 
2195 2200 2205 

Arg Ser Trp Glu Lys Leu Tyr Ala Val Leu Arg Gin Asn Glu Leu Ser 
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221G 2215 2220 

Phe Tyr Lys Asp Pro Lys His Arg Asp Glu Ser Val His Gly Glu Pro 
2225 2230 2235 2240 

Pro Met Ala Leu Pro Gly Cys Ser Val Asn Val Ala Ser Asp Tyr Gin 
2245 * 2250 2255 

Lys Lys Lys Asn Val Leu Ser Leu Arg Leu Pro lie Gly Ala Glu Tyr 
2260 2265 2270 

Leu Phe Gin Cys Gly Ser Glu Glu Asp Met Gin Arg Trp Leu Thr Glu 
2275 ' " 2280 2285 

Leu Gin Val Ala Thr Gly Gin Ala Gin Leu Glu Glu Ala Ser Arg Ser 
2290 2295 2300 

Gin Thr Leu Pro Ala Glu Gly Ser Ala Thr Lys Lys Lys Gly Gly Phe 
2305 2310 2315 2320 

Phe Ser Arg Gly Lys Lys 
2325 



<210> 13 
<211> 2192 
<212> DNA 

<213> Caenorhabditis elegans 



<400> 13 

gaacttgctg 

gacgacaagc 

tgggatgagt 

caacaacttt 

gagatgactc 

caacatttga 

atggagccac 

ttggcagttc 

ttggaacgca 

atttctgaga 

catcgtctcc 

atcaaccaga 

gcattcgaga 

aaagacagaa 

ggagaggctg 

aaagacgagt 

gacaaatttg 

aagtcaccac 

ggtcttcaag 

gctcttcatc 

ggaagtcaag 

cagtttgctc 

tgtgatactc 

tcgttgaatg 

gaggcttcta 

atggaaaaga 

ctctccagaa 

caaatcgaaa 



ctaacaagga 
cagagtacca 
tggaaaagac 
acgtacaaag 
gtgaagacca 
tcgaaactga 
agctcgagga 
aagaacaact 
agaaggccgc 
gacttgttct 
agaagaacct 
tctgtaataa 
agaagattca 
agggagatct 
aagcctggat 
tctcaaccaa 
ctgatacgat 
tcaccgaaca 
atttgtcgaa 
gtgaaatcga 
agaacggaca 
gtgacaccga 
tcattggtca 
aggcctggga 
gacttcttca 
ctcatgctat 
aacatcagaa 
gagatgccgc 



acgtcttgac 
cggaaccatt 
tacagaggag 
tattgctgac 
accaggagat 
aatgatcaag 
acttcaccca 
tcaaagactc 
cttccaattt 
cgctaaagct 
acaacttctt 
tggtcaagag 
agaactgcgc 
tggagaaagc 
gagcgaacaa 
gaaccaaatc 
ccgtgctctt 
gatccaagtt 
ggagcgcaga 
cgatcttctt 
agactacgaa 
aaacatcgga 
cggacacact 
gaatcttctg 
caagttctac 
gcctgatgat 
ctatctcaag 
agagcttcga 



caacttaaac 
gatccacaga 
aagggacaga 
atgaaggaat 
cttaccactg 
aaggctcaac 
gacgagcttg 
caggctccat 
ggtcgcgatg 
caaaatcttg 
tccaatgaaa 
ttgattgacg 
agtgcttggc 
gagaaggctc 
gaattgtaca 
aagaaacacg 
gccaccaagg 
cgtcaagctc 
aagagactcg 
caatggatcg 
cacgtccaaa 
tccgaacgtg 
gacgctccga 
gagttgatgg 
cacgactgcc 
cttggacgtg 
gatattgctg 
gatggttacg 



atgctgccat 
tcgaggaact 
agttgtttga 
gggcaaccca 
tcaatgtggc 
acattgatca 
aaaacattaa 
tggacgaccg 
ttgatgacga 
gagaaagcct 
tcgacaacca 
aaggacatgc 
aagagttgaa 
atcaattctt 
tgatgcaaga 
aacgccttca 
ctcacaagtt 
aaatcgagaa 
aagagacctt 
ccgacaagga 
tgcttcaaga 
ttgccaatgc 
ctatagcatt 
acactcgtgc 
gtgattgctt 
actcaagcag 
caattggaga 
ctggagacaa 



caaccttggt 
cgctactcaa 
tgctaaccgc 
acttgagaac 
catgcaaaaa 
actcatggag 
ggctcatcgt 
acgtaaggca 
gaaattgtgg 
tccagattgt 
cgaaccatgg 
taacggacca 
ggaagccgtc 
gtatgattgc 
cgaaagagga 
atccgatatt 
cgtcgaagag 
gctctacgca 
ggaactttac 
agttgttgca 
gagattccaa 
taatgatgga 
gtggaaggat 
ccaaattctc 
gtcaagaatc 
tgttggtgca 
acaagttgct 
ggctcttgat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 



BNSDOCtD: <WO_0073328A2_I_> 



WO 00/73328 



PCT/EP00/05108 



20 

atcggatctc gcgaaagcga ggttgtcaag gcatggaggc atctccgcgg actctgtgac 1740 
gcccgtacat ctcgtctcat ggatacttct gatcttttca agttcatgaa tatggtccgt 1800 
gatttgttgt tgtggatgga tgaggtgaag agagaaatga actcacaaga acgtccgaag 1860 
gatgtgagcg gagtcgagtt gctcatgaac aatcatcaat cgctcaaggc tgaaatcgat 1920 
gctcgcgagg agaacttcaa cgcgtgcatc tcattgggac gagatctgct caacagaaaa 1980 
cactacncat caagtgaaat cgagaagaag ctgatcaagt tgacaaccga aagagccgaa 2040 
atgatgcgca gatgggaaga tcgttgggaa tatcttcagc tcattctgga agtgtaccaa 2100 
ttcgcccgcg acgctgctgt cgccgaatct tggctcttcg cccaagaacc atatcttatt 2160 
tcgaaggaat acggtagaaa cctcgaagaa ac 2192 

<210> 14 
<211> 731 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 14 

Glu Leu Ala Ala Asn Lys Glu Arg Leu Asp Gin Leu Lys His Ala Ala 
1 5 10 15 

lie Asn Leu Gly Asp Asp Lys Pro Glu Tvr His Gly Thr lie Asp Pro 
20 " * " 25 * 30 

Gin He Glu Glu Leu Ala Thr Gin Trp Asp Glu Leu Glu Lys Thr Thr 
35 40 45 

Glu Glu Lys Gly Gin Lys Leu Phe Asp Ala Asn Arg Gin Gin Leu Tyr 
50 55 60 

Val Gin Ser He Ala Asp Met Lys Glu Trp Ala Thr Gin Leu Glu Asn 
65 70 75 80 

Glu Met Thr Arg Glu Asp Gin Pro Gly Asp Leu Thr Thr Val Asn Val 
85 90 95 

Ala Met Gin Lys Gin His Leu He Glu Thr Glu Met He Lys Lys Ala 
100 105 110 

Gin His lie Asp Gin Leu Met Glu Met Glu Pro Gin Leu Glu Glu Leu 
115 120 125 

His Pro Asp Glu Leu Glu Asn He Lys Ala His Arg Leu Ala Val Gin 
130 135 " 140 

Glu Gin Leu Gin Arg Leu Gin Ala Pro Leu Asp Asp Arg Arg Lys Ala 
145 150 155 160 

Leu Glu Arg Lys Lys Ala Ala Phe Gin Phe Gly Arg Asp Val Asp Asp 
165 170 175 

Glu Lys Leu Trp He Ser Glu Arg Leu Val Leu Ala Lys Ala Gin Asn 
180 185 190 

Leu Gly Glu Ser Leu Pro Asp Cys His Arg Leu Gin Lys Asn Leu Gin 
195 * 200 205 

Leu Leu Ser Asn Glu lie Asp Asn His Glu Pro Trp lie Asn Gin lie 
210 215 220 
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Cys Asn Asn Gly Gin Glu Leu He Asp Glu Gly His Ala Asn Gly Pro 
225 ^ 230 235 240 

Ala Phe Glu Lys Lys He Gin Glu Leu Arg Ser Ala Trp Gin Glu Leu 
245 250 255 

L"s Glu a la w el 1^5 »sn Arn T,v.e; Glv Asd Leu Gly Glu Ser Glu Lys 
260 ' " 265 270 

Ala His Gin Phe Leu Tyr Asp Cys Gly Glu Ala Glu Ala Trp Met Ser 
275 280 285 

Glu Gin Glu Leu Tyr Met Met Gin Asp Glu Arg Gly Lys Asp Glu Phe 
290 295 300 

Ser Thr Lys Asn Gin He Lys Lys His Glu Arg Leu Gin Ser Asp He 
305 ' 310 315 320 

Asp Lys Phe Ala Asp Thr He Arg Ala Leu Ala Thr Lys Ala His Lys 
325 330 335 

Phe Val Glu Glu Lys Ser Pro Leu Thr Glu Gin He Gin Val Arg Gin 
340 345 350 

Ala Gin He Glu Lys Leu Tyr Ala Gly Leu Gin Asp Leu Ser Lys Glu 
355 " 360 365 

Arg Arg Lys Arg Leu Glu Glu Thr Leu Glu Leu Tyr Ala Leu His Arg 
370 375 380 

Glu He Asp Asp Leu Leu Gin Trp lie Ala Asp Lys Glu Val Val Ala 
385 390 395 400 

Gly Ser Gin Glu Asn Gly Gin Asp Tyr Glu His Val Gin Met Leu Gin 
405 410 415 

Glu Arg Phe Gin Gin Phe Ala Arg Asp Thr Glu Asn He Gly Ser Glu 
420 425 430 

Arg Val Ala Asn Ala Asn Asp Gly Cys Asp Thr Leu He Gly His Gly 
435 440 445 

His Thr Asp Ala Pro Thr He Ala Leu Trp Lys Asp Ser Leu Asn Glu 
450 455 460 

Ala Trp Glu Asn Leu Leu Glu Leu Met Asp Thr Arg Ala Gin He Leu 
465 * 470 475 480 

Glu Ala Ser Arg Leu Leu His Lys Phe Tyr His Asp Cys Arg Asp Cys 
485 490 495 

Leu Ser Arg He Met Glu Lys Thr His Ala Met Pro Asp Asp Leu Gly 
500 505 510 

Arg Asp Ser Ser Ser Val Gly Ala Leu Ser Arg Lys His Gin Asn Tyr 
515 520 525 

Leu Lys Asp He Ala Ala He Gly Glu Gin Val Ala Gin He Glu Arg 
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530 535 540 

Asp Ala Ala Glu Leu Arg Asp Gly Tyr Ala Gly Asp Lys Ala Leu Asp 
545 550 555 560 

He Gly Ser Arg Glu Ser Glu Val Val Lys Ala Trp Arg His Leu Arg 
565 570 575 

Gly Leu Cys Asp Ala Arg Thr Ser Arg Leu Met Asp Thr Ser Asp Leu 
580 585 590 

Phe Lys Phe Met Asn Met Val Arg Asp Leu Leu Leu Trp Met Asp Glu 
595 600 605 

Val Lys Arg Glu Met Asn Ser Gin Glu Arg Pro Lys Asp Val Ser Gly 
610 615 620 

Val Glu Leu Leu Met Asn Asn His Gin Ser Leu Lys Ala Glu He Asp 
625 630 635 640 

Ala Arg Glu Glu Asn Phe Asn Ala Cys He Ser Leu Gly Arg Asp Leu 
645 650 655 

Leu Asn Arg Lys His Tyr Ala Ser Ser Glu He Glu Lys Lys Leu He 
660 665 670 

Lys Leu Thr Thr Glu Arg Ala Glu Met Met Arg Arg Trp Glu Asp Arg 
675 680 685 

Trp Glu Tyr Leu Gin Leu lie Leu Glu Val Tyr Gin Phe Ala Arg Asp 
690 695 700 

Ala Ala Val Ala Glu Ser Trp Leu Phe Ala Gin Glu Pro Tyr Leu lie 
705 710 715 720 

Ser Lys Glu Tyr Gly Arg Asn Leu Glu Glu Thr 
725 730 



<210> 15 
<211> 3561 
<212> DNA 

<213> Caenorhabditis elegans 



<400> 15 

atgagctcta 

ggtggaagtg 

cgacacgatg 

acggcgatcg 

aaacgtgata 

atttttgcat 

cttcctgcgt 

ccaccagctt 

tttggagaag 

gttttaattc 

aatattcgaa 

aaacgccggc 

tctccaaata 



gtagtagtga 
gaatctactc 
ttagtgatgt 
aggttgacga 
ttgttccgac 
tgataacgag 
caaactgtgt 
cttattatca 
aatatcgaaa 
ttgaagttca 
aacttattgc 
tatgctcata 
taactcaagt 



tgaaaatgaa 
acaacctcga 
agatgatgaa 
tgctataaca 
cgatgaagat 
tgaatctgat 
tagagaacaa 
tcgactaaat 
agttgctaca 
cacatttgga 
caatgctctt 
tgatggattc 
atacgctgga 



acaacgatac 
gcaggatcat 
gaagagcatt 
gttcttcttt 
gataataaac 
gttaacagaa 
gtatattatc 
gcagctcttc 
gtattaggac 
atcaatgaga 
acgaatttaa 
attcgatgcg 
ctaatcagaa 



acagaacagg 
caaaaagaac 
atgcaagatt 
catctctaca 
ttcgagaact 
aaagacggct 
ttcgaaggaa 
acacaatcgt 
ttgttgaagc 
ccaatcctgg 
catacggtca 
ttgttcgaat 
atctctcatg 



atcaaatacc 
atcgaatgtg 
tcgcgaagat 
tttcgaacac 
tcacgaaaag 
gaaaaaagct 
accaicaaca 
taaggaatca 
gcttgcagaa 
agagcatcga 
aattcactct 
tgtgatcgaa 
gaacgccgat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 
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rcaggaatga gtgaagctct tcaaccaaca gttcatgcac tttcaattgc tgcagtacat 840 
gcacatacac atagattcga tgttacagca acattatctg ctttgtggaa tcttgctggt 900 
cactcagtag agaataaacg aactatttgt gatactccga attgtttaaa agtattagcc 960 
agccttctat ctcctgatgc aagattcacg tcacttgttg attcagctac tggaattctg 1020 
aagtatgtat cacaatatct cgcgaatact tccacgcatt tagagctaag aagcttgctc 1080 
atcactcgaa tgctcacttt gcrcaagagt gcctcgttca cttgtgttac aaatactcta 1140 
ggagctattg caeaccte^t" t-grcaflagat ccccacatgc agcaaatgat tcgqcaggat 1200 
atggctgcag tacagcaact caacgtactc cgaaactcga atcgtgatga tattcggaca 1260 
gcagtgaagt ctgtgctgaa tactcttaat caaccgtgta gtcatcgtta tggtgatatg 1320 
tcacattctg ttggaggagg agccacaggg atgcaaatgc tttcggagcc acaacttcag 1380 
atgcaaacat ctcatcatgc ttatcatgga actgcgtcac caagattact atcactgaga 1440 
gcaactcgag catcacctgg aaaatatata caaccacagg ctcaacaaca gttgatacag 1500 
actccacaag ttgatcaaag atctagtagt ctgccaagac attttgccgt tcaacgaaat 1560 
ggatttgtga tggcacagtc ctacaatcaa caaatggatc aacatcagca acaacaaatg 1620 
atctatcaat tgcagcaaca acaacaaatt atgacggagg atcaagcgca aatggaacat 1680 
catcaacaaa tcatgtacct tcagcagcaa caacaacaat ttcatcaaat tcaacaacag 1740 
caacaaatgc aaaaagctca ggaagctgat ccagttcctc ctacggatga cgatctagat 1800 
attcccacat ctaccgtaat gggaacacga agtaacagtg aaagaagtct cggctcgatg 1860 
aatcctggaa gtgttatgac aaactggaat tcttctctgg atactgcagc caacagttct 1920 
cgtgctcttt ctcctgtatc atacaatgat attccagctt ctccaacaat gtgtgctcag 1980 
gtgttcaatt tacctaaatc cactgaatcg gaacatcatc aattgacgtc tcaacagcag 2040 
aacactacac attatagcag tggatcagct aacactatga ctcggagtga tggtgcaact 2100 
actgtgccaa tggataacat tattacacca acttatgcta ttttgaatcc aatattggta 2160 
cacgaacaga caccgaacgg gacagttcca cgaaagactt cagaagagtt ggatagtcct 2220 
gatgatgttc tcccaggtcc atctcttgag gaggaagaag gtgattatgc tataatagga 2280 
ggtgccgcac aaaagacaga tgacgagctt ttgactcgtt ctattcaatc tgaaatgcct 2340 
acttcgtcgt ctactccgaa gatgaaagtc tcgcctcgcc ttaatggatt cttctctcca 2400 
actcagaaaa caacatcgtc tcctgcatgg tctcatccag atacctcacc gattccaaag 2460 
tcatcttctc atcgaactca acctaatcgt cgtcaagatg cttctgatgc agatcgtctt 2520 
ttaatggaat caataatgtc agagatgcca aaaagtcgaa ttatatctcc acgacttgct 2580 
ggtacacaac aatatcttga acccgagcca gagagaagaa gtcattcaaa aaacgaggaa 2640 
gccgatcgcc gcgacgcatt caccgcatcg catgagccca gcgatcataa cggcattgat 2700 
gtggcaagag gctcagactg gtctccacaa caacaattac atcgaatgga gtcattggaa 2760 
agccaagcta gtagtgaaga tagttttgga ttaactgctg aagaaccaaa ttcatcaact 2820 
tctggtgctg ctgcaaacac tatgagattt gacgatgaga tcgatgcatc tcttccaatg 2880 
gattgtgtcg atgatgatga ttacgactat acttatgatc atttcgaaga ttatgaagat 2940 
gaggaggatc ctgatgcgac tcagtttgat gatggagttg atgcacaact aactatcgat 3000 
tgttcaatga tttctagtgg aagcggttca tcgcaaagaa atgaaactac gacaacgtca 3060 
cgagattcaa aagctttagc tacttccaca ccaaaaggat ctgcatcttc actcccagga 3120 
gttcgacaag ccacccgagt atcaacgaat ggaaagtcaa gactgcctgt tccaaaaacg 3180 
aatggatcac ttgttgataa gaatcccaag ccaatcatcg cttcccggcg gccacgattg 3240 
ccaccgaagc cgacactttt aaaagataag cactatccag aagaggattc gattgagaat 3300 
cagacgagag atgatacgat atatgtgaat gctccagttg ttgaggcaga acaagaacga 3360 
atttatatga atgcattgaa gcaacagaag aacattgagc aatcaccgtc aattggaaac 3420 
ggatcaccta tagctaaatc tgcaattgta acgccctaca attatcagaa accgcctttc 3480 
actggtcgaa ataatggtga aatgagcaac gagaagagtg tgacaccgaa tcctaaacaa 3540 
atgcttgtaa caatagtcta a 3561 

<210> 16 
<211> 1186 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 16 

Met Ser Ser Ser Ser Ser Asp Glu Asn Glu Thr Thr He His Arg Thr 
1 5 10 15 

Gly Ser Asn Thr Gly Gly Ser Gly He Tyr Ser Gin Pro Arg Ala Gly 
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20 25 30 

Ser Ser Lys Arg Thr Ser Asn Val Arg His Asp Val Ser Asp Val Asp 
35 " 40 45 

Asp Glu Glu Glu His Tyr Ala Arg Phe Arg Glu Asp Thr Ala He Glu 
50 55 60 

Val Asp Asp Ala He Thr Val Leu Leu Ser Ser Leu His Phe Glu His 
65 70 75 80 

Lys Arg Asp lie Val Pro Thr Asp Glu Asp Asp Asn Lys Leu Arg Glu 
* 85 90 95 

Leu His Glu Lys He Phe Ala Leu He Thr Ser Glu Ser Asp Val Asn 
100 105 110 

Arg Lys Arg Arg Leu Lys Lys Ala Leu Pro Ala Ser Asn Cys Val Arg 
115 120 125 

Glu Gin Val Tyr Tyr Leu Arg Arg Lys Pro Ser Thr Pro Pro Ala Ser 
130 ' * 135 140 

Tvr Tyr His Arg Leu Asn Ala Ala Leu His Thr He Val Lys Glu Ser 
145 150 155 160 

Phe Gly Glu Glu Tyr Arg Lys Val Ala Thr Val Leu Gly Leu Val Glu 
165 170 175 

Ala Leu Ala Glu Val Leu He Leu Glu Val His Thr Phe Gly He Asn 
180 185 190 

Glu Thr Asn Pro Gly Glu His Arg Asn He Arg Lys Leu He Ala Asn 
195 ~ 200 205 

Ala Leu Thr Asn Leu Thr Tyr Gly Gin He His Ser Lys Arg Arg Leu 
210 215 220 

Cys Ser Tyr Asp Gly Phe He Arg Cys Val Val Arg He Val He Glu 
225 " 230 235 

Ser Pro Asn He Thr Gin Val Tyr Ala Gly Leu He Arg Asn Leu Ser 
245 250 255 

Trp Asn Ala Asp Ser Gly Met Ser Glu Ala Leu Gin Pro Thr Val His 
260 265 270 

Ala Leu Ser He Ala Ala Val His Ala His Thr His Arg Phe Asp Val 
275 280 285 

Thr Ala Thr Leu Ser Ala Leu Trp Asn Leu Ala Gly His Ser Val Glu 
290 295 300 

Asn Lys Arg Thr He Cys Asp Thr Pro Asn Cys Leu Lys Val Leu Ala 
305 " 310 315 -^u 

Ser Leu Leu Ser Pro Asp Ala Arg Phe Thr Ser Leu Val Asp Ser Ala 
325 330 335 
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Thr Glv He Leu Lys Tyr Val Ser Gin Tyr Leu Ala Asn Thr Ser Thr 
340 345 350 

His Leu Glu Leu Arg Ser Leu Leu He Thr Arg Met Leu Thr Leu Leu 
355 360 365 

Lys Ser Ala Ser Phe Thr Cys Val Thr Asn Thr Leu Gly Ala He Ala 
370 375 380 

Asn Leu He Val Lys Asp Pro His Met Gin Gin Met He Arg Gin Asp 
385 390 395 400 

Met Ala Ala Val Gin Gin Leu Asn Val Leu Arg Asn Ser Asn Arg Asp 
405 410 415 

Asp He Arg Thr Ala Val Lys Ser Val Leu Asn Thr Leu Asn Gin Pro 
420 425 430 

Cys Ser His Arg Tyr Gly Asp Met Ser His Ser Val Gly Gly Gly Ala 
435 440 445 

Thr Gly Met Gin Met Leu Ser Glu Pro Gin Leu Gin Met Gin Thr Ser 
4 50 4 55 4 60 

His His Ala Tyr His Gly Thr Ala Ser Pro Arg Leu Leu Ser Leu Arg 
465 470 475 480 

Ala Thr Arg Ala Ser Pro Gly Lys Tyr He Gin Pro Gin Ala Gin Gin 
485 490 495 

Gin Leu He Gin Thr Pro Gin Val Asp Gin Arg Ser Ser Ser Leu Pro 
500 505 510 

Arg His Phe Ala Val Gin Arg Asn Gly Phe Val Met Ala Gin Ser Tyr 
515 520 525 

Asn Gin Gin Met Asp Gin His Gin Gin Gin Gin Met He Tyr Gin Leu 
530 535 540 

Gin Gin Gin Gin Gin He Met Thr Glu Asp Gin Ala Gin Met Glu His 
545 550 555 560 

His Gin Gin He Met Tyr Leu Gin Gin Gin Gin Gin Gin Phe His Gin 
565 570 575 

He Gin Gin Gin Gin Gin Met Gin Lys Ala Gin Glu Ala Asp Pro Val 
580 585 590 

Pro Pro Thr Asp Asp Asp Leu Asp He Pro Thr Ser Thr Val Met Gly 
595 600 605 

Thr Arg Ser Asn Ser Glu Arg Ser Leu Gly Ser Met Asn Pro Gly Ser 
610 615 620 

Val Met Thr Asn Trp Asn Ser Ser Leu Asp Thr Ala Ala Asn Ser Ser 
625 630 635 640 
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Arg Ala Leu Ser Pro Val Ser Tyr Asn Asp He Pro Ala Ser Pro Thr 
645 650 

Met Cys Ala Gin Val Phe Asn Leu Pro Lys Ser Thr Glu Ser Glu His 
660 665 

H<s Gin Leu Thr Ser Gin Gin Gin Asn Thr Thr His Tyr Ser Ser Gly 
675 680 685 

Ser Ala Asn Thr Met Thr Arg Ser Asp Gly Ala Thr Thr Val Pro Met 

690 695 700 

Asp Asn lie lie Thr Pro Thr Tyr Ala lie Leu Asn Pro He Leu Val 
705 710 715 

Pis Glu Gin Thr Pro Asn Gly Thr Val Pro Arg Lys Thr Ser Glu Glu 
725 730 

Leu Asp Ser Pro Asp Asp Val Leu Pro Gly Pro Ser Leu Glu Glu Glu 

740 745 
Glu Gly Asp Tyr Ala He He Gly Gly Ala Ala Gin Lys Thr Asp Asp 

Glu Leu Leu Thr Arg Ser He Gin Ser Glu Met Pro Thr Ser Ser Ser 

770 775 
Thr Pro Lys Met Lys Val Ser Pro Arg Leu Asn Gly Phe Phe Ser Pro 
785 790 795 

Thr Gin Lys Thr Thr Ser Ser Pro Ala Trp Ser His Pro Asp Thr Ser 
805 aiu 

Pro lie Pro Lys Ser Ser Ser His Arg Thr Gin Pro Asn Arg Arg Gin 

820 82b 
Asp Ala Ser Asp Ala Asp Arg Leu Leu Met Glu Ser He Met Ser Glu 

835 " 840 
Met Pro Lys Ser Arg He He Ser Pro Arg Leu Ala Gly Thr Gin Gin 

850 860 
Tyr Leu Glu Pro Glu Pro Glu Arg Arg Ser His Ser Lys Asn Glu Glu 
865 370 875 

Ala Asp Arg Arg Asp Ala Phe Thr Ala Ser His Glu Pro Ser Asp His 



885 



Asn Gly lie Asp Val Ala Arg Gly Ser Asp Trp Ser Pro Gin Gin Gin 

900 9Ub 
Leu His Arg Met Glu Ser Leu Glu Ser Gin Ala Ser Ser Glu Asp Ser 

915 920 
Phe Gly Leu Thr Ala Glu Glu Pro Asn Ser Ser Thr Ser Gly Ala Ala 

930 935 940 

Ala Asn Thr Met Arg Phe Asp Asp Glu He Asp Ala Ser Leu Pro Met 
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945 950 955 960 

Asp Cys Val Asp Asp Asp Asp Tyr Asp Tyr Thr Tyr Ass His Phe Glu 
965 970 975 

Asp Tyr Glu Asp Glu Glu Asp Pro Asp Ala Thr Gin Phe Asp Asp Gly 
980 985 990 

Val Asp Ala Gin Leu Thr lie Asp Cys Ser Met lie Ser Ser Gly Ser 
995 1000 * 1005 

Gly Ser Ser Gin Arg Asn Glu Thr Thr Thr Thr Ser Arg Asp Ser Lys 
1010 1015 1020 

Ala Leu Ala Thr Ser Thr Pro Lys Gly Ser Ala Ser Ser Leu Pro Gly 
1025 1030 1035 1040 

Val Arg Gin Ala Thr Arg Val Ser Thr Asn Gly Lys Ser Arg Leu Pro 
1045 1050 1055 

Val Pro Lys Thr Asn Gly Ser Leu Val Asp Lys Asn Pro Lys Pro lie 
1060 1065 ^ 1070 

lie Ala Ser Arg Arg Pro Arg Leu Pro Pro Lys Pro Thr Leu Leu Lys 
1075 1080 1085 

Asp Lys His Tyr Pro Glu Glu Asp Ser lie Glu Asn Gin Thr Arg Asp 
1090 1095 1100 

Asp Thr He Tyr Val Asn Ala Pro Val Val Glu Ala Glu Gin Glu Arg 
1105 1110 1115 1120 

He Tyr Met Asn Ala Leu Lys Gin Gin Lys Asn He Glu Gin Ser Pro 
1125 1130 1135 

Ser He Gly Asn Gly Ser Pro He Ala Lys Ser Ala He Val Thr Pro 
1140 1145 1150 

Tyr Asn Tyr Gin Lys Pro Pro Phe Thr Gly Arg Asn Asn Gly Glu Met 
1155 1160 1165 

Ser Asn Glu Lys Ser Val Thr Pro Asn Pro Lys Gin Met Leu Val Thr 
1170 1175 1180 

He Val 
1185 



<210> 17 
<211> 1742 
<212> DNA 

<213> Caenorhabditis elegans 
<400> 17 

atgagctcta gtagtagtga tgaaaatgaa 
ggtggaagtg gaatctactc acaacctcga 
cgacatgatg ttagtgatgt agatgatgaa 
acggcgatcg aggttgacga tgctataaca 



acaacgatac acagaacagg atcaaatacc 60 
gcaggatcat caaaaagaac atcgaatgtg 120 
gaagagcatt atgcaagatt tcgcgaagat 180 
gttcttcttt catctctaca tttcgaacac 240 
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aaacgtgata 
atttttgcat 
cttcctgcgt 
ccaccagctt 
tttggagaag 
gttttaattc 
aatattcgaa 
aaacgccggc 
tctccaaata 
tcaggaatga 
gcacatacac 
cactcagtag 
agccttctat 
aagtatgtat 
atcactcgaa 
ggagctattg 
atggctgcag 
gcagtgaagt 
tcacattctg 
atgcaaacat 
gcaactcgag 
actccacaag 
ggatttgtga 
atctatcaat 
catcaacaaa 
ca 



ttgttccgac 
tgataacgag 
caaactgtgt 
cttattatca 
aatatcgaaa 
ttgaagttca 
aacttattgc 
tatgctcata 
taactcaagt 
gtgaagctct 
atagattcga 
agaataaacg 
ctcctgatgc 
cacaatatct 
tgctcacttt 
caaacctaat 
tacagcaact 
ctgtgctgaa 
ttggaggagg 
ctcatcatgc 
catcacctgg 
ttgatcaaag 
tggcacagtc 
tgcagcaaca 
tcatgtacct 



cgatgaagat 
tgaatctgat 
tagagaacaa 
tcgactaaat 
agttgctaca 
cacatttgga 
caatgctctt 
tgatggattc 
atacgctgga 
tcaaccaaca 
tgttacagca 
aactatttgt 
aagattcacg 
cgcgaatact 
gctcaagagt 
tgtcaaagat 
caacgtactc 
tactcttaat 
agccacaggg 
ttatcatgga 
aaaatatata 
atctagtagt 
ctacaatcaa 
acaacaaatt 
tcagcagcaa 



28 

gataataaac 
gttaacagaa 
gtatattatc 
gcagctcttc 
gtattaggac 
atcaatgaga 
acgaatttaa 
attcgatgcg 
ctaatcagaa 
gttcatgcac 
acattatctg 
gatactccga 
tcacttgttg 
tccacgcatt 
gcctcgttca 
ccccacatgc 
cgaaactcga 
caaccgtgta 
atgcaaatgc 
actgcgtcac 
caaccacagg 
ctgccaagac 
caaatggatc 
atgacggagg 
caacaacaat 



ttcgagaact 
aaagacggct 
ttcgaaggaa 
acacaatcgt 
ttgttgaagc 
ccaatcctgg 
catacggtca 
ttgttcgaat 
atctctcatg 
tttcaattgc 
ctttgtggaa 
attgtttaaa 
attcagctac 
tagagctaag 
cttgtgttac 
agcaaatgat 
atcgtgatga 
gtcatcgtta 
tttcggagcc 
caagattact 
ctcaacaaca 
attttgccgt 
aacatcagca 
atcaagcgca 
ttcatcaaat 



tcacgaaaag 
gaaaaaagct 
accatcaaca 
taaggaatca 
gcttgcagaa 
agagcatcga 
aattcactct 
tgtgatcgaa 
gaacgccgat 
tgcagtacat 
tcttgctggt 
agtattagcc 
tggaattctg 
aagcttgctc 
aaatactcta 
tcggcaggat 
tattcggaca 
tggtgatatg 
acaacttcag 
atcactgaga 
gttgatacag 
tcaacgaaat 
acaacaaatg 
aatggaacat 
tcaacaacag 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1742 



<210> 18 
<211> 509 
<212> PRT 

<213> Caenorhabditis elegans 



<400> 18 

Met Ser Ser Ser Ser Ser Asp Glu Asn Glu Thr Thr 
15 10 



lie His Arg Thr 
15 



Gly Ser Asn Thr Gly Gly Ser Gly lie Tvr Ser Gin 
20 25 



Pro Arg Ala Gly 
30 



Ser Ser Lys Arg Thr Ser Asn Val Arg His Asp Val 
35 40 



Ser Asp Val Asp 
45 



Asp Glu Glu Glu His Tyr Ala Arg Phe Arg Glu Asp 
50 55 60 



Thr Ala He Glu 



Val Asp Asp Ala He Thr Val Leu Leu Ser Ser Leu 
65 70 75 



His Phe Glu His 
80 



Lys Arg Asp He Val Pro Thr Asp Glu Asp Asp Asn 
85 90 



Lys Leu Arg Glu 
95 



Leu His Glu Lys He Phe Ala Leu He Thr Ser Glu 
100 105 



Ser Asp Val Asn 
110 



Arg Lys Arg Arg Leu Lys Lys Ala Leu Pro Ala Ser 
115 120 



Asn Cys Val Arg 
125 



Glu Gin Val Tyr Tyr Leu Arg Arg Lys Pro Ser Thr Pro Pro Ala Ser 
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130 



135 



140 



Tyr Tyr His Arg Leu Asn Ala Ala Leu His Thr lie Val Lys Glu Ser 
145 150 155 160 

Phe Giy Glu Glu Tyr Arg Lys Val Ala Thr Val Leu Gly Leu Val Glu 
165 170 175 

Ala Leu Ala Glu Val Leu lie Leu Glu Val His Thr Phe Giy He Asn 
180 185 190 

Glu Thr Asn Pro Gly Glu His Arg Asn He Arg Lys Leu He Ala Asn 
195 200 205 

Ala Leu Thr Asn Leu Thr Tyr Gly Gin He His Ser Lys Arg Arg Leu 
210 215 220 

Cys Ser Tyr Asp Gly Phe He Arg Cvs Val Val Arg lie Val He Glu 
225 230 235 240 

Ser Pro Asn lie Thr Gin Val Tyr Ala Gly Leu lie Arg Asn Leu Ser 
245 250 255 

Trp Asn Ala Asp Ser Gly Met Ser Glu Ala Leu Gin Pro Thr Val His 
260 265 270 

Ala Leu Ser lie Ala Ala Val His Ala His Thr His Arg Phe Asp Val 
275 280 285 

Thr Aia Thr Leu Ser Ala Leu Trp Asn Leu Ala Gly His Ser Val Glu 
290 295 300 

Asn Lys Arg Thr lie Cys Asp Thr Pro Asn Cys Leu Lys Val Leu Ala 
305 310 315 320 

Ser Leu Leu Ser Pro Asp Ala Arg Phe Thr Ser Leu Val Asp Ser Ala 
325 ^ " 330 335 

Thr Gly lie Leu Lys Tyr Val Ser Gin Tyr Leu Ala Asn Thr Ser Thr 
340 345 350 

His Leu Glu Leu Arg Ser Leu Leu lie Thr Arg Met Leu Thr Leu Leu 
355 360 365 

Lys Ser Ala Ser Phe Thr Cys Val Thr Asn Thr Leu Gly Ala lie Ala 
370 375 380 

Asn Leu lie Val Lys Asp Pro His Met Gin Gin Met lie Arg Gin Asp 
385 390 395 400 

Met Ala Ala Val Gin Gin Leu Asn Val Leu Arg Asn Ser Asn Arg Asp 
405 410 415 

Asp lie Arg Thr Ala Val Lys Ser Val Leu Asn Thr Leu Asn Gin Pro 
420 ~ 425 430 



Cys Ser His Arg Tyr Gly Asp Met Ser His Ser Val Gly Gly Gly Ala 
435 ' 440 445 
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Thr Glv Met Gin 
450 

Kis His Ala Tyr 
465 

Ala Thr Arg Ala 



Gin Leu He Gin 
500 



Met Leu Ser Glu 
455 

His Gly Thr Ala 
470 



Ser Pro Gly Lys 
485 

Thr Pro Gin Val 



30 

Pro Gin Leu Gin 
460 



Ser Pro Arg Leu 
475 

Tyr He Gin Pro 
490 

Asp Gin Arg Ser 
505 



Met Gin Thr Ser 

Leu Ser Leu Arg 
480 

Gin Ala Gin Gin 
495 

Ser 



<210> 19 

<211> 1998 

<212> DNA 

<213> Caenorhabditis elegans 



<400> 19 

atggtggaac 

gatgctcgaa 

gatccttctg 

aaaaaagaag 

gataatctga 

agtgctccag 

aattcactga 

aatgaaattc 

gcgagacgat 

aagaatgtaa 

aaattagact 

tcgttccgtg 

accaatgatt 

ggtccgagcc 

gaagcatgtt 

gcaccccaaa 

tcagcatcag 

ccagtttggg 

acggatacga 

aaagagctct 

gttgttgaag 

atgagatgct 

acggttacct 

gaatgttcaa 

aactcattgg 

tattctgaaa 

atggtcgaca 

cagttcaaat 

ccaaagagtt 

ccattgtcaa 

tcaccttcaa 

aatttctccc 

cgatgtgtcc 

catcttcttt 

<210> 20 
<211> 665 
<212> PRT 



tgtgcgaact 
t cgctgctct 
tatcaactga 
attttcaatt 
caacttgggg 
atgtttggaa 
ttgccaactt 
gcgaggcaaa 
gctcatctgg 
cagcaccaac 
tcctggcaat 
atgctattat 
ttccgctctt 
acattgaaga 
cattcttcaa 
catcaacaaa 
caaatgaagc 
ctcaaactgc 
atttctcatt 
caattgtcgg 
gcgttgctgc 
taattgaaga 
caaaaggccc 
agactgacaa 
actgctggct 
atgcattcct 
gtttgtctct 
caatgcaaca 
catcgttccc 
caagtcgcat 
cgacatctcg 
tctcagacgg 
gtttaacgac 
tccagtga 



gcacatgatg 
caacttttgt 
ttctttagac 
agcattcgca 
tcggattcgg 
tgtaaaaaga 
tgtttccgga 
tgaagaaata 
aggacaaaat 
ttcaatggca 
gcatcatgag 
taaaatgcag 
ctttcaagaa 
ttggcaatct 
gagcaacacc 
catcgtatca 
atgtcgaacg 
ccaatcaaag 
tgcaccaaca 
attgccgatt 
aatttcaaga 
tggtcttcaa 
agctacaaaa 
cgtcaaagtt 
gtgctatatt 
tctttctgcg 
tttgccagtg 
atggggcgga 
agcaagatta 
cagtatcagc 
tattcgagtt 
agaaaaagtt 
ttctcattca 



cctgaagagg 
ggagaagatt 
tctcaacagt 
gactctggac 
acatctgaac 
agtgatagtg 
gatgctactc 
attcgtaaag 
cagaaacgaa 
atcacatcgt 
atgccatctc 
aaatgtgagc 
gactctcccg 
ctatctgttc 
caattattaa 
aattgcattg 
tgttatcgag 
acggttcttt 
acttcaacga 
tatgcagcaa 
ggggatggaa 
gaaaatgtat 
gatgttcatt 
gaaatcttct 
gtgttgaaag 
agttctgaat 
atcgaagctc 
gcatccagaa 
tcaactgctc 
tcaacaactt 
gcatctataa 
agtgttctat 
aaaatcaaca 



atcaatggcc 
cgatgctctt 
ttcgagagag 
attggcaatc 
cattggatga 
ctcgatcacc 
gattcgtcga 
atcgttggag 
catttgcgga 
ctgataatga 
tctgcgagtc 
cactaccgtc 
actcaggact 
ttctgcccaa 
catcaagtac 
atcgccgtat 
tacgaaggag 
gtgactgcgc 
ctcgccatca 
agcggacact 
gtgatttgct 
cggcatggac 
caattgtaaa 
ttgaggagct 
agaaggtgct 
acagaactct 
gcagtgattc 
tagcttctga 
catcaagaag 
caacacctcg 
tgggtgactt 
caactcgtgg 
atggagttat 



cgatgacatt 
tgaatcagtt 
atgtcaaatg 
aggtataaat 
acgtactgcc 
aaatcgtcca 
tgtaaatgat 
aagagattca 
tattctggaa 
aaagcctccg 
attcaccgct 
gattacttca 
cggttgtagt 
acacgtcgca 
atcaaaaaca 
ctctggtatt 
gatacaccca 
atcaacaccc 
actgcgcgca 
cgttgagaat 
tgtcatcgcc 
aatgattcag 
acaattggaa 
tatcagagaa 
gaaaacattg 
tctttggaga 
agttcaccaa 
ctcaagagtt 
atcaaggata 
atcagctcga 
cacacttgcc 
aggacttgct 
tccgattgaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

1998 
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<213> Caenorhabditis elegans 
<400> 20 

Met Val Glu Leu Cys Glu Leu His Met Met Pro Glu Glu Asp Gin Trp 
15 10 15 

Pro Asd lie Aso Ala Arg lie Ala Ala Leu Asn Phe Cys Gly Glu 

20 * 25 30 

Asp Ser Met Leu Phe Glu Ser Val Asp Pro Ser Val Ser Thr Asp Ser 
35 40 45 

Leu Asp Ser Gin Gin Phe Arg Glu Arg Cys Gin Met Lys Lys Glu Asp 
50 55 60 

Phe Gin Leu Ala Phe Ala Asp Ser Gly His Trp Gin Ser Gly lie Asn 
65 70 75 80 

Asp Asn Leu Thr Thr Trp Gly Arg lie Arg Thr Ser Glu Pro Leu Asp 
85 90 95 

Glu Arg Thr Ala Ser Ala Pro Asp Val Trp Asn Val Lys Arg Ser Asp 
100 105 110 

Ser Ala Arg Ser Pro Asn Arg Pro Asn Ser Leu lie Ala Asn Phe Val 
115 120 125 

Ser Gly Asp Ala Thr Arg Phe Val Asp Val Asn Asp Asn Glu lie Arg 
130 135 140 

Glu Ala Asn Glu Glu lie lie Arg Lys Asp Arg Trp Arg Arg Asp Ser 
145 150 155 160 

Ala Arg Arg Cys Ser Ser Gly Gly Gin Asn Gin Lys Arg Thr Phe Ala 
165 170 175 

Asp lie Leu Glu Lys Asn Val Thr Ala Pro Thr Ser Met Ala lie Thr 
180 185 190 

Ser Ser Asp Asn Glu Lys Pro Pro Lys Leu Asp Phe Leu Ala Met His 
195 200 205 

His Glu Met Pro Ser Leu Cys Glu Ser Phe Thr Ala Ser Phe Arg Asp 
210 215 220 

Ala lie lie Lys Met Gin Lys Cys Glu Pro Leu Pro Ser lie Thr Ser 
225 230 235 240 

Thr Asn Asp Phe Pro Leu Phe Phe Gin Glu Asp Ser Pro Asp Ser Gly 
245 250 255 

Leu Gly Cys Ser Gly Pro Ser His lie Glu Asp Trp Gin Ser Leu Ser 
260 265 270 

Val Leu Leu Pro Lys His Val Ala Glu Ala Cys Ser Phe Phe Lys Ser 
275 280 285 

Asn Thr Gin Leu Leu Thr Ser Ser Thr Ser Lys Thr Ala Pro Gin Thr 
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OQC 300 

290 295 



Ser Thr Asn He Val Ser Asn Cys He Asp Arg Arg He Ser Gly lie 
305 310 - 3x5 

Ser Ala Ser Ala Asn Glu Ala Cys Arg Thr Cys Tyr Arg Val Arg Arg 



325 



Arg lie His Pro Pro Val Trp Ala Gin Thr Ala Gin Ser Lys Thr Val 



340 345 

Leu Cys Asp Cys Ala Ser Thr Pro Thr Asp Thr Asn Phe Ser Phe Ala 
355 360 

Pro Thr Thr Ser Thr Thr Arg His Gin Leu Arg Ala Lys Glu Leu Ser 
370 375 380 

lie Val Gly Leu Pro lie Tyr Ala Ala Lys Arg Thr Leu Val Glu Asn 



385 



390 



val val Glu Gly Val Ala Ala He Ser Arg Gly Asp Gly Ser Asp Leu 



405 



Leu Val He Ala Met Arg Cys Leu lie Glu Asp Gly Leu Gin Glu Asn 
420 425 

Val Ser Ala Trp Thr Met He Gin Thr Val Thr Ser Lys Gly Pro Ala 
435 440 44b 

Thr Lys Asp Val His Ser He Val Lys Gin Leu Glu Glu Cys Ser Lys 

Thr Asp Asn Val Lys Val Glu He Phe Phe Glu Glu Leu He Arg Glu 
465 * 470 

Asn Ser Leu Asp Cys Trp Leu Cys Tyr lie Val Leu Lys Glu Lys Val 



485 



Leu Lys Thr Leu Tyr Ser Glu Asn Ala Phe Leu Leu Ser Ala Ser Ser 

500 505 
Glu Tyr Arg Thr Leu Leu Trp Arg Met Val Asp Ser Leu Ser Leu Leu 

Pro Val lie Glu Ala Arg Ser Asp Ser Val His Gin Gin Phe Lys Ser 

530 "5 540 

Met Gin Gin Trp Gly Gly Ala Ser Arg He Ala Ser Asp Ser Arg Val 
545 550 555 

Pro Lys Ser Ser Ser Phe Pro Ala Arg Leu Ser Thr Ala Pro Ser Arg 
565 570 

Arg Ser Arg He Pro Leu Ser Thr Ser Arg He Ser He Ser Ser Thr 

580 585 
Thr Ser Thr Pro Arg Ser Ala Arg Ser Pro Ser Thr Thr Ser Arg He 
595 600 
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Arg Val Ala Ser He Met Gly Asp Phe Thr Leu Ala Asn Phe Ser Leu 
610 615 620 

Ser Asc Gly Glu Lys Val Ser Val Leu Ser Thr Arg Gly Gly Leu Ala 
625 " 630 635 640 

Arg Cvs Val Arg Leu Thr Thr Ser His Ser Lys He Asn Asn Gly Val 
645 650 655 

He Pro He Glu His Leu Leu Phe Gin 
660 665 



<210> 21 
<211> 1839 
<212> DNA 

<213> Caenorhabditis elegans 
<400> 21 

tttcgagaga gatgtcaaat gaaaaaagaa 
cattggcaat caggtataaa tgataatctg 
ccattggatg aacgtactgc cagtgctcca 
gctcgatcac caaatcgtcc aaattcactg 
cgattcgtcg atgtaaatga taatgaaatt 
gatcgtrgga gaagagattc agcgagacga 
acatttgcgg atattctgga aaagaatgta 
tctgataatg aaaagcctcc gaaattagac 
ctctgcgagt cattcaccgc ttcgttccgt 
ccactaccgt cgattacttc aaccaatgat 
gactcaggac tcggttgtag tggtccgagc 
cttctgccca aacacgtcgc agaagcatgt 
acatcaagta catcaaaaac agcaccccaa 
gatcgccgta tctctggtat ttcagcatca 
gtacgaagga ggatacaccc accagtttgg 
tgtgactgcg catcaacacc cacggatacg 
actcgccatc aactgcgcgc aaaagagctc 
aagcggacac tcgttgagaa tgttgttgaa 
agtgatttgc ttgtcatcgc catgagatgc 
tcggcatgga caatgattca gacggttacc 
tcaattgtaa aacaattgga agaatgttca 
tttgaggagc ttatcagaga aaactcattg 
gagaaggtgc tgaaaacatt gtattctgaa 
tacagaactc ttctttggag aatggtcgac 
cgcagtgatt cagttcacca acagttcaaa 
atagcttctg actcaagagt tccaaagagt 
ccatcaagaa gatcaaggat accattgtca 
tcaacacctc gatcagctcg atcaccttca 
atgggtgact tcacacttgc caatttctcc 
tcaacrcgtg gaggacttgc tcgatgtgtc 
aatggagtta ttccgattga acatcttctt 

<210> 22 

<211> 612 

<212> PRT 

<213> Caenorhabditis elegans 

<400> 22 



gattttcaat tagcattcgc agactctgga 60 
acaacttggg gtcggattcg gacatctgaa 120 
gatgtttgga atgtaaaaag aagtgatagt 180 
attgccaact ttgtttccgg agatgctact 2 40 
cgcgaggcaa atgaagaaat aattcgtaaa 300 
tgctcatctg gaggacaaaa tcagaaacga 360 
acagcaccaa cttcaatggc aatcacatcg 420 
ttcctggcaa tgcatcatga gatgccatct 480 
gatgctatta ttaaaatgca gaaatgtgag 540 
tttccgctct tctttcaaga agactctccc 600 
cacattgaag attggcaatc tctatctgtt 660 
tcattcttca agagcaacac ccaattatta 720 
acatcaacaa acatcgtatc aaattgcatt 780 
gcaaatgaag catgtcgaac gtgttatcga 840 
gctcaaactg cccaatcaaa gacggttctt 900 
aatttctcat ttgcaccaac aacttcaacg 960 
tcaattgtcg gattgccgat ttatgcagca 1020 
ggcgttgctg caatttcaag aggggatgga 1080 
ttaattgaag atggtcttca agaaaatgta 1140 
tcaaaaggcc cagctacaaa agatgttcat 1200 
aagactgaca acgtcaaagt tgaaatcttc 1260 
gactgctggc tgtgctatat tgtgttgaaa 1320 
aatgcattcc ttctttctgc gagttctgaa 1380 
agtttgtctc ttttgccagt gatcgaagct 1440 
tcaatgcaac aatggggcgg agcatccaga 1500 
tcatcgttcc cagcaagatt atcaactgct 1560 
acaagtcgca tcagtatcag ctcaacaact 1620 
acgacatctc gtattcgagt tgcatctata 1680 
ctctcagacg gagaaaaagt tagtgttcta 1740 
cgtttaacga cttctcattc aaaaatcaac 1800 
ttccagtga 1839 
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Gin Met Lys Lys 

1 * 
Ma Asp Ser Gly His Trp Gin Ser Gly He Asn Asp Asn Leu Thr Thr 



Phe Arc Glu Arg Cys Gin Met Lys Lys Glu Asp Phe Gin Leu Ala Phe 



20 

Trp Glv Arg He Arg Thr Ser Glu Pro Leu Asp Glu Arg Thr Ala Ser 

35 40 
A la Pro Asp Val Tr P Asn Val Lys Arg Ser Asp Ser Ala Arg Ser Pro 



50 55 
Asn Arg Pro Asn Ser Leu He Ala Asn Phe Val Ser Gly Asp Ala Thr 

65 70 75 

Arg Phe Val Asp Val Asn Asp Asn Glu lie Arg Glu Ala Asn Glu Glu 



85 



lie lie Arg Lys Asp Arg Trp Arg Arg Asp Ser Ala Arg Arg Cys Ser 



100 



Ser Glv Gly Gin Asn Gin Lys Arg Thr Phe Ala Asp lie Leu Glu Lys 

115 120 

Asn Val Thr Ala Pro Thr Ser Met Ala He Thr Ser Ser Asp Asn Glu 

130 135 140 

Lys Pro Pro Lys Leu Asp Phe Leu Ala Met His His Glu Met Pro Ser 

145 150 155 

Leu Cys Glu Ser Phe Thr Ala Ser Phe Arg Asp Ala He He Lys Met 



165 



Gin Lys Cys Glu Pro Leu Pro Ser He Thr Ser Thr Asn Asp Phe Pro 

180 lbb 
Leu Phe Phe Gin Glu Asp Ser Pro Asp Ser Gly Leu Gly Cys Ser Gly 



195 



200 



Pro Ser His He Glu Asp Trp Gin Ser Leu Ser Val Leu Leu Pro Lys 
210 215 2 20 



His val Ala Glu Ala Cys Ser Phe Phe Lys Ser Asn Thr Gin Leu Leu 
225 230 235 

Thr Ser Ser Thr Ser Lys Thr Ala Pro Gin Thr Ser Thr Asn He Val 



245 



Ser Asn Cys He Asp Arg Arg He Ser Gly He Ser Ala Ser Ala Asn 

2 65 



260 



Glu Ala Cys Arg Thr Cys Tyr Arg Val Arg Arg Arg He His Pro Pro 

275 280 
Val Trp Ala Gin Thr Ala Gin Ser Lys Thr Val Leu Cys Asp Cys Ala 

290 295 
Ser Thr Pro Thr Asp Thr Asn Phe Ser Phe Ala Pro Thr Thr Ser Thr 
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305 310 315 320 

Thr Arg His Gin Leu Arg Ala Lys Glu Leu Ser He Val Gly Leu Pro 
325 330 335 

He Tyr Ala Ala Lys Arg Thr Leu Val Glu Asn Val Val Glu Gly Val 
?40 345 350 

Ala Ala He Ser Arg Gly Asp Gly Ser Asp Leu Leu Val He Ala Met 
355 " 360 365 

Arg Cys Leu He Glu Asp Gly Leu Gin Glu Asn Val Ser Ala Trp Thr 
370 375 380 

Met He Gin Thr Val Thr Ser Lys Gly Pro Ala Thr Lys Asp Val His 
385 390 395 400 

Ser He Val Lys Gin Leu Glu Glu Cys Ser Lys Thr Asp Asn Val Lys 
405 410 415 

Val Glu He Phe Phe Glu Glu Leu He Arg Glu Asn Ser Leu Asp Cys 
420 425 430 

Trp Leu Cys Tyr He Val Leu Lys Glu Lys Val Leu Lys Thr Leu Tyr 
435 440 445 

Ser Glu Asn Ala Phe Leu Leu Ser Ala Ser Ser Glu Tyr Arg Thr Leu 
450 455 460 

Leu Trp Arg Met Val Asp Ser Leu Ser Leu Leu Pro Val He Glu Ala 
465 ~ " 470 475 480 

Arg Ser Asp Ser Val His Gin Gin Phe Lys Ser Met Gin Gin Trp Gly 
485 490 495 

Gly Ala Ser Arg lie Ala Ser Asp Ser Arg Val Pro Lys Ser Ser Ser 
500 505 510 

Phe Pro Ala Arg Leu Ser Thr Ala Pro Ser Arg Arg Ser Arg He Pro 
515 " 520 525 

Leu Ser Thr Ser Arg He Ser He Ser Ser Thr Thr Ser Thr Pro Arg 
530 535 540 

Ser Ala Arg Ser Pro Ser Thr Thr Ser Arg He Arg Val Ala Ser He 
545 550 555 560 

Met Glv Asp Phe Thr Leu Ala Asn Phe Ser Leu Ser Asp Gly Glu Lys 
565 570 575 

Val Ser Val Leu Ser Thr Arg Gly Gly Leu Ala Arg Cys Val Arg Leu 
580 585 590 

Thr Thr Ser His Ser Lvs He Asn Asn Gly Val He Pro He Glu His 
595 " 600 605 

Leu Leu Phe Gin 
610 
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<210> 23 
<211> 3435 
<212> DNA 

<213> Caenorhabditis elegans 



<400> 23 

atgcaggcga 

atatatttgt 

gtccccgagc 

ggagtttcag 

ggaagtgagc 

ggcgagattg 

ttgattgaga 

ggtgaaaaag 

tcaactgctc 

tcagaagctc 

agacatcacc 

gatgatatgg 

ggagaacgtg 

gcccaacaca 

attgatccaa 

catattcaat 

aagttcagaa 

actcttgttg 

tttatgcgaa 

cgactattgt 

gatggtttag 

acattgttag 

acaaatcgat 

ctccgttttt 

aaatgggagg 

ggatattgcg 

tctcgatatc 

aaaatcactt 

gatcttacaa 

gtgagcaact 

gcgacagctt 

cctgctattc 

tacgttcttc 

tcaagtcttt 

atgcgtgcat 

aaagtcattc 

ttcacgccga 

gtcacggaaa 

cgtcaaaaat 

ccacgtctta 

gctgaggact 

ctgaataatt 

acgaatgctc 

gcaaatacac 

gtcgaacgaa 

gataaattac 

aattcacttg 

atgttgagca 

gatatgaaaa 

aataaaaaga 



tagatgaatc 
ctgacgagga 
gtgcaactcg 
aacaggaaga 
aaccagagga 
tgctaccaat 
aaattggaag 
atccagaaga 
cacgtcaata 
gtgaaaatcg 
gcaatcgaaa 
tgcttccgag 
aacgtggtcg 
ttctgatgga 
tgagtgttga 
ctctcaaaga 
taaatccgcc 
cacgtgctct 
aaggagctga 
ttgatcaagc 
ctccagttcg 
cactgatgga 
tagatactct 
cccttcctga 
aaaacaaacc 
gtgccgattt 
ctcatatcta 
ctgagcattt 
ttccatcgag 
tgatttcgtt 
caagcgaact 
gtcttcttct 
ccgccattct 
taacagatgg 
cagcaaccgg 
ctgtatcagt 
ttcttttcct 
ttttcagaca 
atttcgagca 
tgcgagatcg 
attatgagat 
gcgaatataa 
tcgaatataa 
ttcgagatgc 
ttgaaacagt 
tcacagaaat 
caaaagaaat 
agcttggtgt 
gtgaagaagg 
aattgctcaa 



caagcgaaat 
agaggaagat 
acgtagcacc 
gtcgccggtt 
aaatctagca 
tgctgaaatt 
ggaagaagag 
agaagaggat 
cagtcttcgc 
acgggctcgt 
tggatctcgt 
acctgataaa 
atttatgccg 
tcgaatgaga 
ttcgtcggtt 
agttgtcctt 
aaaaggtgtc 
tgcgaatgaa 
ttgtttgagt 
atatgccatg 
ttcttcgaaa 
cggattagat 
tgatccagct 
tttgaatgcc 
gattccggaa 
gaaattcttg 
catgtgttct 
tggacatgca 
accattggat 
gagaattcca 
ggaacaagtt 
atgtggatct 
agcaaaactg 
acgtccagaa 
cccatgtatc 
tcaacatatg 
atcgacgctt 
tgcaaactgt 
cgttattgag 
tcgattcgtt 
tatcgaaaca 
tcatgcggac 
tccgtcgaca 
aattgatgat 
ttcacgaatg 
tcccaaaggt 
tgagcaatgg 
agctgctcca 
aacgagcaca 
aaagaagaag 



caaaaagrgc 
tttgctgaag 
agaagacgtt 
cgcaggactc 
gcagatgatc 
gatggaatgg 
gaggaaggcg 
gatagcagca 
cgccgtcaac 
cttgaacatc 
cgaagacgaa 
cgtcaatcga 
atcaatatga 
aaaactgatg 
ggtttcgatc 
ttcccaatgc 
gtattctatg 
tgccgaagag 
aaatgggtcg 
cgtccatcaa 
caagatcaaa 
ggtcgaggag 
ctacgaagac 
cgacgacaaa 
acactcgatg 
tgcacagaag 
gaaaggctga 
atgcggagga 
gaacgaactt 
caaggatatc 
gttagggcat 
gagcaactcg 
gatcatctac 
gaagcatttt 
atgcttcttc 
cttatcacat 
gacacttcgt 
atcactttga 
aagattaaca 
gaatttgtcg 
ccgatttgta 
aaatttgttg 
acaaaagacg 
ctgatcgagt 
ctacaagacg 
tttgcccgaa 
acatcagagc 
acattggaac 
tcaactgatg 
ggtcaaaaga 



cgcctgcaaa 
cggcgcacgt 
catcaatgca 
gcaaagccgc 
cgttgccaat 
ctgaacaaga 
ctgaagaaga 
atgcagaaag 
cagttgtcca 
atcgggttgc 
gcgattcgga 
ggcctcatat 
ccgaaaaaga 
ctggtcaagg 
aagttggagg 
tttatcctga 
ggcccccagg 
gagccaacaa 
gagagtctga 
ttatcttttt 
ttcatgcgag 
aagttgttgt 
caggacgttt 
ttttagatat 
ctattgcaga 
cagttcttat 
aactcgacgt 
tcacacctgc 
caattttact 
gatgcgtgga 
tggaacctaa 
cggacggtgg 
cggtgttctc 
caaatgcaat 
catcaatcga 
gtctggaatc 
ttgaagatgc 
atccttcccg 
ctccaccaaa 
agcccgtgga 
tgcaggatat 
ctgacttgat 
gaaaactgat 
gtgaacttga 
ctggagtaac 
aaaaggcgtg 
gtgaagctga 
tcgttgttgt 
gagttccagc 
aaagcaaaac 



acgaaagcgg 
tgagaacaca 
cgaagaactt 
caagcgcctt 
ggaaggaggt 
aaatgaggac 
tgagcaaagc 
ttcagaggaa 
gttcaatgca 
aaatcaaaac 
ttctgatagt 
gcataaccgt 
attgcagtct 
tgcaagtgac 
actcggtcat 
agtatttgaa 
aaccggaaaa 
agttgcattc 
aagacaactt 
tgatgaaatc 
catagtgagt 
aattggagct 
tgatagggaa 
tcacacttcc 
gagaactagt 
tggattgaga 
tgccaccatt 
ttccagacga 
tggagatact 
aaatgcaatg 
tccaaccgtg 
acaaacgtct 
tctttctgtt 
tcagtcggca 
tgaatggatt 
aatgactggt 
gcctgaatat 
ccgaacaatt 
agtatttgat 
tccggatgag 
tatggaaaag 
attgattcaa 
tcgtcaaatg 
cgaatcgttt 
tccaacctct 
gtctatgacc 
aaatcaaaaa 
ccctgtagaa 
ttctgctgga 
cggcgaatct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 
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gaagagcacg atgaagatag tacagttgaa gacgcaggag aagatacaat tgtagaaaai: 3060 

cttgaaatca agaaaaatca ggaaactcca aacagtgaac atgatatcga gatgaaagat 3120 

gcttctaaag attctacacc ttccgtgcaa atttcgattg ctgagaaaga attgatagrc 3180 

tcgaagccag caacatgcga gctgattcag tgttgtgtcg aaaagtcgga gggttggtca 3240 

gtcagcgagt tggaacgttt gtcttctgtt ctatcacata caattgaacg attccgggac 3300 

gaatggaacc gcgaaaatct accagcccaa cttacccaaa ttgtacgtga atggcaaaca 3360 

gctgacgatt cgaacaatac aattgttaat ggaaccct-ra ^^^^t-cgaa tggaaatctt 3420 

gccaatggac actaa 3435 

<210> 24 
<211> 1144 
<212> ?RT 

<213> Caenorhabditis elegans 
<400> 24 

Met Gin Ala lie Asp Glu Ser Lys Arg Asn Gin Lys Val Pro Pro Ala 
1 5 ~ 10 15 

Lys Arg Lys Arg lie Tyr Leu Ser Asp Glu Glu Glu Glu Asp Phe Ala 
20 25 30 

Glu Ala Ala His Val Glu Asn Thr Val Pro Glu Arg Ala Thr Arg Arg 
35 40 45 

Ser Thr Arg Arg Arg Ser Ser Met His Glu Glu Leu Gly Val Ser Glu 
50 55 60 

Gin Glu Glu Ser Pro Val Arg Arg Thr Arg Lys Ala Ala Lys Arg Leu 
65 70 75 80 

Gly Ser Glu Gin Pro Glu Glu Asn Leu Ala Ala Asp Asp Pro Leu Pro 
85 90 95 

Met Glu Gly Gly Gly Glu lie Val Leu Pro He Ala Glu He Asp Gly 
100 ~ 105 110 

Met Ala Glu Gin Glu Asn Glu Asp Leu He Glu Lys He Gly Arg Glu 
115 120 125 

Glu Glu Glu Glu Gly Ala Glu Glu Asp Glu Gin Ser Gly Glu Lys Asp 
130 135 140 

Pro Glu Glu Glu Glu Asp Asp Ser Ser Asn Ala Glu Ser Ser Glu Glu 
145 150 155 160 

Ser Thr Ala Pro Arg Gin Tyr Ser Leu Arg Arg Arg Gin Pro Val Val 
165 ^ 170 175 

Gin Phe Asn Ala Ser Glu Ala Arg Glu Asn Arg Arg Ala Arg Leu Giu 
180 185 190 

His His Arg Val Ala Asn Gin Asn Arg His His Arg Asn Arg Asn Gly 
195 200 205 

Ser Arg Arg Arg Arg Ser Asp Ser Asp Ser Asp Ser Asp Asp Met Val 
210 215 * 220 

Leu Pro Arg Pro Asp Lys Arg Gin Ser Arg Pro His Met His Asn Arg 
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oo 0 235 240 

225 230 

Gly Glu Arg Glu Arg Gly Arg Phe Met Pro lie Asn Met Thr Glu Lys 
245 250 

Glu Leu Gin Ser Ala Gin His He Leu Met Asp Arg Met Arg Lys Thr 
260 265 

Asp Ala Gly Gin Gly Ala Ser Asp He Asp Pro Met Ser Val Asp Ser 

280 



275 



Ser Val Gly Phe Asp Gin Val Gly Gly Leu Gly His His lie Gin Ser 



290 ■ 295 300 



Leu Lys Glu Val Val Leu Phe Pro Met Leu Tyr Pro Glu Val Phe Glu 
305 310 315 

Lys Phe Arg He Asn Pro Pro Lys Gly Val Val Phe Tyr Gly Pro Pro 



325 



Gly Thr Gly Lys Thr Leu Val Ala Arg Ala Leu Ala Asn Glu Cys Arg 



340 



Arg Gly Ala Asn Lys Val Ala Phe Phe Met Arg Lys Gly Ala Asp Cys 
355 360 365 

Leu Ser Lys Trp Val Gly Glu Ser Glu Arg Gin Leu Arg Leu Leu Phe 
370 375 380 

Asp Gin Ala Tyr Ala Met Arg Pro Ser He He Phe Phe Asp Glu He 
385 390 Jys 

Asp Gly Leu Ala Pro Val Arg Ser Ser Lys Gin Asp Gin He His Ala 
405 410 

Ser lie Val Ser Thr Leu Leu Ala Leu Met Asp Gly Leu Asp Gly Arg 

425 ftJU 



420 



Gly Glu Val Val Val He Gly Ala Thr Asn Arg Leu Asp Thr Leu Asp 
435 440 445 

Pro Ala Leu Arg Arg Pro Gly Arg Phe Asp Arg Glu Leu Arg Phe Ser 

450 4 55 460 

Leu Pro Asp Leu Asn Ala Arg Arg Gin He Leu Asp He His Thr Ser 
465 4 ™ 475 

Lys Trp Glu Glu Asn Lys Pro He Pro Glu Thr Leu Asp Ala He Ala 
485 490 

Glu Arg Thr Ser Gly Tyr Cys Gly Ala Asp Leu Lys Phe Leu Cys Thr 
500 505 3AU 

Glu Ala Val Leu He Gly Leu Arg Ser Arg Tyr Pro His He Tyr Met 
515 520 525 

Cys Ser Glu Arg Leu Lys Leu Asp Val Ala Thr lie Lys He Thr Ser 
530 535 540 
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Glu His Phe Gly His Ala Met Arg Arg He Thr Pro Ala Ser Arg Arg 
545 550 555 560 

Asp Leu Thr He Pro Ser Arg Pro Leu Asp Glu Arg Thr Ser He Leu 
565 570 575 

Leu Gly Asp Thr Val Ser Asn Leu He Ser Leu Arg He Pro Gin Gly 
580 585 590 

Tyr Arg Cys Val Glu Asn Ala Met Ala Thr Ala Ser Ser Glu Leu Glu 
595 600 605 

Gin Val Val Arg Ala Leu Glu Pro Asn Pro Thr Val Pro Ala He Arg 
610 ~ 615 620 

Leu Leu Leu Cys Gly Ser Glu Gin Leu Ala Asp Gly Gly Gin Thr Ser 
625 " 630 635 640 

Tyr Val Leu Pro Ala He Leu Ala Lys Leu Asp His Leu Pro Val Phe 
645 650 655 

Ser Leu Ser Val Ser Ser Leu Leu Thr Asp Gly Arg Pro Glu Glu Ala 
660 665 670 

Phe Ser Asn Ala He Gin Ser Ala Met Arg Ala Ser Ala Thr Gly Pro 
675 680 685 

Cys He Met Leu Leu Pro Ser He Asp Glu Trp He Lys Val He Pro 
690 695 " 700 

Val Ser Val Gin His Met Leu lie Thr Cys Leu Glu Ser Met Thr Gly 
705 710 715 720 

Phe Thr Pro He Leu Phe Leu Ser Thr Leu Asp Thr Ser Phe Glu Asp 
725 730 735 

Ala Pro Glu Tyr Val Thr Glu He Phe Arg His Ala Asn Cys He Thr 
740 745 750 

Leu Asn Pro Ser Arg Arg Thr He Arg Gin Lys Tyr Phe Glu His Val 
755 760 765 

lie Glu Lys lie Asn Thr Pro Pro Lys Val Phe Asp Pro Arg Leu Met 
770 ^ 775 780 

Arg Asp Arg Arg Phe Val Glu Phe Val Glu Pro Val Asp Pro Asp Glu 
785 ~ 790 795 800 

Ala Glu Asp Tyr Tyr Glu He lie Glu Thr Pro He Cys Met Gin Asp 
805 810 815 

He Met Glu Lys Leu Asn Asn Cys Glu Tyr Asn His Ala Asp Lys Phe 
820 825 830 



Val Ala Asp Leu lie Leu lie Gin Thr Asn Ala Leu Glu Tyr Asn Pro 
835 840 845 
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Ser Thr Thr Lys Asp Gly Lys Leu He Arg Gin Met Ala Asn Thr Leu 

850 855 
Arg Asp Ala He Asp Asp Leu He Glu Cys Glu Leu Asp Glu Ser Phe 
8 65 870 

Val Glu Arg He Glu Thr Val Ser Arg Met Leu Gin Asp Ala Gly Val 
885 8yu 

Thr Pro Thr Ser Asp Lys Leu Leu Thr Glu He Pro Lys Gly Phe Ala 
900 90o 

Arg Lys Lys Ala Trp Ser Met Thr Asn Ser Leu Ala Lys Glu He Glu 

915 920 
Gin Trp Thr Ser Glu Arg Glu Ala Glu Asn Gin Lys Met Leu Ser Lys 

930 935 
Leu Gly Val Ala Ala Pro Thr Leu Glu Leu Val Val Val Pro Val Glu 
945 950 9d5 



Asp Met Lys Ser Glu Glu Gly Thr Ser Thr Ser Thr Asp Gly Val Pro 
965 970 

Ala Ser Ala Gly Asn Lys Lys Lys Leu Leu Lys Lys Lys Lys Gly Gin 
980 985 

Lys Lys Ser Lys Thr Gly Glu S.r Glu Glu His Asp Glu Asp Ser Thr 
995 luuu 

Val Glu Asp Ala Gly Glu Asp Thr He Val Glu Asn Leu Glu He Lys 

ioio 1015 10 

Lys Asn Gin Glu Thr Pro Asn Ser Glu His Asp He Glu Met Lys Asp 

1030 1035 



1025 

Ala Ser Lys Asp Ser Thr Pro Ser Val Gin He Ser He Ala Glu Lys 
1045 1UbU 

Glu Leu lie Val Ser Lys Pro Ala Thr Cys Glu Leu He Gin Cys Cys 

1060 106b 
Val Glu Lys Ser Glu Gly Trp Ser Val Ser Glu Leu Glu Arg Leu Ser 

1075 1080 
Ser Val Leu Ser His Thr lie Glu Arg Phe Arg Asp Glu Trp Asn Arg 

1090 1095 1100 

Glu Asn Leu Pro Ala Gin Leu Thr Gin He Val Arg Glu Trp Gin Thr 
1105 H10 ■ 1115 

Ala Asp Asp Ser Asn Asn Thr He Val Asn Gly Thr Leu Asn Lys Ser 
1125 11JU 

Asn Gly Asn Leu Ala Asn Gly His 
1140 
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<210> 25 
<211> 1908 
<212> DNA 

<213> Caenorhabditis eiegans 



<400> 25 

gaagacgaag 

caatacagtc 

aatcgacggg 

cgaaatggat 

ccgagacctg 

ggtcgactta 

atggarcrgaa 

gttgatrcgt 

aaagaagttg 

ccgccaaaag 

gctctrgcga 

gctgaztgtt 

caagcaratg 

gttcgt~ctt 

atggacggat 

actctrgatc 

cctga" tga 

aaaccgattc 

gatttgaaat 

atctacatgt 

cattttggac 

tcgagaccat 

tcgttgagaa 

gaactggaac 

cttctatgtg 

attctagcaa 

gatggacgtc 

accggcccat 

tcagt icaac 

ttcctatcga 

agacargcaa 

gagcacgtta 



ttcgccgccg 
ctcgtcttga 
ctcgtcgaag 
ataaacgtca 
tgccgatcaa 
tgagaaaaac 
cggttggttt 
tccttttccc 
gtgtcgtatt 
atgaatgccg 
tgagtaaatg 
ccatgcgtcc 
cgaaacaaga 
tagatggtcg 
cagctctacg 
atgcccgacg 
cggaaacact 
tcttgtgcac 
gttctgaaag 
atgcaatgcg 
tggatgaacg 
ttccacaagg 
aagttgttag 
gatctgagca 
aactggatca 
cagaagaagc 
gtatcatgct 
atatgcttat 
cgcttgacac 
actgtatcac 
ttgagaagat 



— - -j 

tcaaccagtt 
acatcatcgg 
acgaagcgat 
atcgaggcct 
tatgaccgaa 
tgatgctggt 
cgatcaagtt 
aatgctttat 
ctatgggccc 
aagaggagcc 
ggtcggagag 
atcaattatc 
tcaaattcat 
aggagaagtt 
aagaccagga 
acaaatttta 
cgatgctatt 
agaagcagtt 
gctgaaactc 
gaggatcaca 
aacttcaatt 
atatcgatgc 
ggcattggaa 
actcgcggac 
tctaccggtg 
attttcaaat 
tcttccatca 
cacatgtctg 
ttcgtttgaa 
tttgaatcct 
taacactcca 



gaaagttcaq 
gtccagttca 
gttgcaaatc 
tcggattctg 
catatgcata 
aaagaattgc 
caaggtgcaa 
ggaggactcg 
cctgaagtat 
ccaggaaccg 
aacaaagttg 
tctgaaagac 
ttttttgatg 
gcgagcatag 
gttgtaattg 
cgttttgata 
gatattcaca 
gcagagagaa 
cttattggat 
gacgttgcca 
cctgcttcca 
ttacttggag 
gtggaaaatg 
cctaatccaa 
ggtggacaaa 
ttctctcttt 
gcaattcagt 
atcgatgaat 
gaatcaatga 
gatgcgcctg 
tcccgccgaa 
ccaaaagtat 



aggaatcaac 
atgcatcaga 
aaaacagaca 
atagtgatga 
accgtggaga 
agtctgccca 
gtgacattga 
gtcatcatat 
ttgaaaagtt 
gaaaaactct 
cattctttat 
aacttcgact 
aaatcgatgg 
tgagtacatt 
gagctacaaa 
gggaactccg 
cttccaaatg 
ctagtggata 
tgagatctcg 
ccattaaaat 
gacgagatct 
atactgtgag 
caatggcgac 
ccgtgcctgc 
cgtcttacgt 
ctgtttcaag 
cggcaatgcg 
ggattaaagt 
ctggtttcac 
aatatgtcac 
caattcgtca 
ttgatcca 



tgctccacgn 
agctcgtgaa 
tcaccgcaat 
tatggtgctt 
acgtgaacgt 
acacattcrg 
tccaatgagt 
tcaatctctc 
cagaataaat 
tgttgcacgt 
gcgaaaagga 
attgtttgat 
tttagctcca 
gttagcactg 
tcgattagat 
tttttccctt 
ggaggaaaac 
ttgcggtgcc 
atatcctcat 
cacttctgag 
tacaattcca 
caacttgatt 
agcttcaagc 
tattcgtctt 
tcttcccgcc 
tcttttaaca 
tgcatcagca 
cattcctgta 
gccgattctt 
ggaaattttc 
aaaatatttc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1908 



<210> 26 
<211> 636 
<212> PRT 

<213> Caenorhabditis eiegans 
<400> 26 

Glu GIu Glu Glu Asp Asp Ser Ser Asn Ala Glu 
15 10 



Ser Ser Glu Glu Ser 
15 



Thr Aia Pro Arg Gin Tyr Ser Leu Arg Arg Arg Gin Pro Val Val Gin 
20 ~ 25 30 

Phe Asn Ala Ser Glu Ala Arg Glu Asn Arg Arg Ala Arg Leu Glu His 
35 40 45 

His Arg Val Ala Asn Gin Asn Arg His His Arg Asn Arg Asn Gly Ser 
50 55 60 



Arg Arg Arg Arg Ser Asp Ser Asp Ser Asp Ser Asp Asp Met Val Leu 



BNSOOCID: <WO 0073328A2J_> 



WO 00/73328 PCT7EPOO/05108 

42 

65 70 75 80 

Pro Arg Pro Asp Lys Arg Gin Ser Arg Pro His Met His Asn Arg Gly 
85 90 95 

Glu Arg Glu Arg Gly Arg Phe Met Pro lie Asn Met Thr Glu Lys Glu 
100 105 110 

Leu Gin Ser Ala Gin His lie Leu Met Asp Arg Met Arg Lys Thr Asp 
115 120 125 

Ala Gly Gin Gly Ala Ser Asp lie Asp Pro Met Ser Val Asp Ser Ser 
130 135 140 

Val Gly Phe Asp Gin Val Gly Gly Leu Gly His His He Gin Ser Leu 
145 150 155 160 

Lys Glu Val Val Leu Phe Pro Met Leu Tyr Pro Glu Val Phe Glu Lys 
165 170 175 

Phe Arg lie Asn Pro Pro Lys Gly Val Val Phe Tyr Gly Pro Pro Gly 
180 185 190 

Thr Gly Lys Thr Leu Val Ala Arg Ala Leu Ala Asn Glu Cys Arg Arg 
195 200 205 

Gly Ala Asn Lys Val Ala Phe Phe Met Arg Lys Gly Ala Asp Cys Leu 
210 215 220 

Ser Lys Trp Val Gly Glu Ser Glu Arg Gin Leu Arg Leu Leu Phe Asp 
225 230 235 240 

Gin Ala Tyr Ala Met Arg Pro Ser He He Phe Phe Asp Glu He Asp 
245 250 255 

Gly Leu Ala Pro Val Arg Ser Ser Lys Gin Asp Gin He His Ala Ser 
260 265 270 

He Val Ser Thr Leu Leu Ala Leu Met Asp Gly Leu Asp Gly Arg Gly 
275 280 285 

Glu Val Val Val He Gly Ala Thr Asn Arg Leu Asp Thr Leu Asp Pro 
290 ' 295 " 300 

Ala Leu Arg Arg Pro Gly Arg Phe Asp Arg Glu Leu Arg Phe Ser Leu 
305 310 315 320 

Pro Asp Leu Asn Ala Arg Arg Gin He Leu Asp He His Thr Ser Lys 
325 330 335 

Trp Glu Glu Asn Lys Pro He Pro Glu Thr Leu Asp Ala He Ala Glu 
340 345 350 

Arg Thr Ser Gly Tyr Cys Gly Ala Asp Leu Lys Phe Leu Cys Thr Glu 
355 * 360 " 365 

Ala Val Leu lie Gly Leu Arg Ser Arg Tyr Pro His He Tyr Met Cys 
370 375 380 



BNSDOCID: <WO 0073328A2_L> 



WO 00/73328 



PCT/EP00/05108 



43 

Ser Glu Arg Leu Lys Leu Asp Val Ala Thr He Lys He Thr Ser Giu 
385 390 395 400 

His Phe Gly His Ala Met Arg Arg He Thr Pro Ala Ser Arg Arg Asp 
405 410 415 

Leu Thr He Pro Ser Arg Pro Leu Asp Glu Arg Thr Ser He Leu Leu 
420 425 430 

Gly Asp Thr Val Ser Asn Leu He Ser Leu Arg He Pro Gin Gly Tyr 
435 440 445 

Arg Cys Val Glu Asn Ala Met Ala Thr Ala Ser Ser Glu Leu Glu Gin 
450 455 460 

Val Val Arg Ala Leu Glu Pro Asn Pro Thr Val Pro Ala He Arg Leu 
465 470 475 480 

Leu Leu Cys Gly Ser Glu Gin Leu Ala Asp Gly Gly Giri Thr Ser Tyr 
.. 485 490 495 

Val Leu Pro Ala lie Leu Ala Lys Leu Asp His Leu Pro Val Phe Ser 
500 505 510 

Leu Ser Val Ser Ser Leu Leu Thr Asp Gly Arg Pro Glu Glu Ala Phe 
515 520 525 

Ser Asn Ala He Gin Ser Ala Met Arg Ala Ser Ala Thr Gly Pro Cys 
530 535 540 

He Met Leu Leu Pro Ser He Asp Glu Trp He Lys Val He Pro Val 
545 550 555 560 

Ser Val Gin His Met Leu He Thr Cys Leu Glu Ser Met Thr Gly Phe 
565 570 575 

Thr Pro He Leu Phe Leu Ser Thr Leu Asp Thr Ser Phe Glu Asp Ala 
580 585 590 

Pro Glu Tyr Val Thr Glu He Phe Arg His Ala Asn Cys lie Thr Leu 
595 600 605 

Asn Pro Ser Arg Arg Thr He Arg Gin Lys Tyr Phe Glu His Val lie 
610 615 620 

Glu Lys He Asn Thr Pro Pro Lys Val Phe Asp Pro 
625 630 635 



<210> 27 
<211> 3024 
<212> DNA 

<213> Caenorhabditis elegans 
<400> 27 

atggaacgac acggttcaga attacaagga gaagtggttt tcactacgac gagtactgcg 60 
aataacaatg tgaaacgagc cgtcgaatat ttcgaagatg ataatcaaga tgatgcacta 120 
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acatccactt 
gatgttccgc 
tctgcgtccc 
ttatcacaga 
acctctggga 
gatgatttta 
gaggaatatc 
cagtttggag 
aacgaaaacc 
tcctcctctt 
gctcaaattc 
gacgccgact 
gggctctcac 
ccgtcgaatg 
gcatcggata 
gttccagttc 
agatatcgga 
gaagttgatg 
tccgccgatc 
aagcaagtgc 
tctctagtac 
cggagtatga 
catcagaaca 
gtcgaggaca 
cgaagtgacg 
gagcacgtgt 
catcaacggc 
aaggtgccat 
aaaacagacg 
agaatccaac 
tttggaaatc 
aagaagtata 
gggcattacg 
cgacacccag 
cacggagcat 
cgtggaatat 
tggccgcaga 
gctcacgatg 
ttcgtgcgat 
acgctccggc 
atcccgccgg 
gcctctgccg 
cgacccgccg 
gttgatgaag 
gacaacggga 
gccaacggtc 
gtgaagcttg 
caatatccct 
tcgattatct 



catccggcaa 
cgccaccagt 
ccgaacgtca 
gtacaaggct 
aaaacaacat 
ctgatacagc 
gccggcaacc 
ggggaactct 
agctccgcga 
ccgaagaaat 
cagaggagaa 
ttgacaattc 
cgcttgagga 
tatcgttggc 
ttagaagaga 
tacggaagga 
agtcaaggga 
agaaggagga 
caaaatctca 
ctgcaccagt 
catcaaggat 
gaggaaaacg 
ttcggctccg 
ccgaagtcgt 
tcatctcgcg 
tcttcgggct 
tcgaaaagtt 
acgtgttgca 
taacgatgaa 
ccaaaagaga 
gaccgccacc 
gctcatttga 
cgggcacccg 
attttggagc 
ctccgttcga 
cgatttatga 
cgcagaccct 
agcaaattga 
tcgccgcctc 
acgagaatac 
cgcctatacg 
atcagatctt 
aactgccacc 
gaattgtttg 
acgtgacgcc 
tgggtcttac 
tcgccgacaa 
tttcctcggg 
ctgaaaaatc 



cagtacacag 
ggctccagaa 
attccttgat 
cgatgagacg 
ttcctccgca 
tgccccggta 
acatcagaat 
gtccggtaga 
gaagaagctc 
tgtggaagcc 
gcccaaaccg 
cccaccactt 
ggctcagaga 
agaggaggcg 
agaggaggaa 
tctggaagtt 
aattgaggaa 
tattgatgcg 
cacgccgatt 
tgtcgttgct 
cagtggaagg 
gaagacccgt 
tgctccagct 
cgtcgagctc 
tgacgtgttc 
ctcgttcctt 
tgcaccgtcc 
cttacgattc 
cgagttgtat 
cgcagcattc 
ggtgatcacc 
agatcagtca 
gatttctgag 
gcatgtgcat 
tcctgatact 
gcagcaggga 
gcaattcgat 
gagcacattt 
ccaacatcgt 
gattcaggcg 
gcaaggtctc 
tgcgaaaatg 
accggcaccg 
tgattcccaa 
acgtggcatg 
cctagtcgac 
tggagccgga 
gtgtacacta 
gtaa 



44 

aaggaatcgt 
actccggctc 
gagagtcacc 
tttattgagg 
gcttcacctg 
gccccgcoac 
ccgtttgatg 
gacccatttg 
cacaaaaaag 
tcaattcacg 
aaggctatcc 
catcattact 
gctctccgag 
aagcttgctg 
gaagtggtgg 
gaggaggctc 
ccggtggttg 
atttttgaga 
ttgatggccg 
caggagtcgc 
cagtcgacgc 
gcgattcccg 
acgaaaaaga 
ctgaacggcc 
tctcttatcg 
cgtgatggcg 
ggatggaaaa 
aaattttatc 
ctacaatgta 
gaactcgccg 
gattatttcg 
cggctgaaga 
gccaagcaca 
cgtgtgttcc 
gggtcatcgc 
ggagcccgtg 
aagaagcgat 
tatacggatc 
tggatgatga 
atgccagacg 
gaagaatcac 
tcagtgagcc 
tcgtcgaaat 
gctgaaaact 
caatttgaca 
gggaatctca 
atgaaagcgg 
gagttgtgta 



ctccgttcac 
cagcccaaaa 
tcggcggaat 
aatactctat 
gtcccaagtc 
cagcacctac 
aggaagagga 
atgaagattc 
agcaactagc 
aggatgagcc 
cggcttttga 
cggcggttca 
ccaaccgagc 
ccagacagag 
aagaagatcc 
cgaaatcggt 
ttgatcggtt 
aatatcgaaa 
acgagtacaa 
cgatcttaaa 
ggcgaagtgt 
agttcttcga 
aacgcataag 
aaaaagttga 
tccagaacat 
agcactactt 
gtgtggcacg 
cgcagatttt 
gacgagacgt 
cgctcgccct 
atattcaaca 
acattctcgc 
aatacattca 
gaacgaagcc 
tttggattgg 
aggtgattgc 
ttgtgattgt 
atcactcgaa 
agatgcgtca 
tgattgttga 
ctccatcaac 
aggaaaaacc 
tcgcggctca 
ttgagcgcgt 
ttttgctcgt 
acggagttcc 
taagaattcg 
aatctacaca 



agattttgat 
ccgccgagaa 
tggatctccg 
tgagttggac 
tccatttgac 
aaaagctgcc 
agaagaatca 
tggaaactct 
tcaccggctc 
aattgtgatg 
taacgcctat 
tttggaaact 
aaggcataag 
atactcgaat 
agcagtggta 
acgaccgcct 
tgttgaggaa 
gacgtctgta 
agaaccacag 
acgaagaaac 
caccagtgtt 
cctaacccga 
cctacaccgc 
agtggcatgc 
gaatatcaat 
tatagaggat 
cgtgggcgtt 
ggatttcatc 
actcgaagag 
ccaagccgaa 
ctacctgccc 
cgagctgcac 
aatctgtcag 
aacatcagct 
aattatgcca 
cgagcatgtt 
tgccgttgga 
atcctcctat 
gtggaagagc 
agggcagact 
tccactgctc 
ggcggccaac 
atatgatacg 
cgactcgact 
gaaagacccc 
aggagtgtat 
aaatttcagt 
taacgttttt 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3024 



<210> 28 
<211> 1007 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 28 

Met Glu Arg His Gly Ser Glu Leu Gin Gly Glu 
15 10 



Val Val Phe Thr Thr 
15 
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Thr Ser Thr Ala Asn Asn Asn Val Lys Arg Ala Val Glu Tyr Phe Glu 
20 25 30 

Asp Asc Asn Gin Asp Asp Ala Leu Thr Ser Thr Ser Ser Gly Asn Ser 
35 40 45 

Thr Gin Lys Glu Ser Ser Pro Phe Thr Asp Phe Asp Asp Val Pro Pro 
50 55 " 60 

Pro Pro Val Ala Pro Glu Thr Pro Ala Pro Ala Gin Asn Arg Arg Glu 
65 70 75 80 

Ser Ala Ser Pro Glu Arg Gin Phe Leu Asp Glu Ser His Leu Gly Gly 
85 90 95 

lie Giv Ser Pro Leu Ser Gin Ser Thr Arg Leu Asp Glu Thr Phe lie 
100 105 110 

Glu Glu Tyr Ser lie Glu Leu Asp Thr Ser Gly Lys Asn Asn lie Ser 
115 120 . - 125 

Ser Ala Ala Ser Pro Gly Pro Lys Ser Pro Phe Asp Asp Asp Phe Thr 
130 135 " 140 

Asp Thr Ala Ala Pro Val Ala Pro Pro Pro Ala Pro Thr Lys Ala Ala 
145 150 155 160 

Glu Glu Tyr Arg Arg Gin Pro His Gin Asn Pro Phe Asp Glu Glu Glu 
165 170 175 

Glu Glu Glu Ser Gin Phe Gly Gly Gly Thr Leu Ser Gly Arg Asp Pro 
180 185 190 

Phe Aso Glu Asp Ser Gly Asn Ser Asn Glu Asn Gin Leu Arg Glu Lys 
195 200 205 

Lys Leu His Lys Lys Glu Gin Leu Ala His Arg Leu Ser Ser Ser Ser 
210 215 220 

Glu Glu He Val Glu Ala Ser He His Glu Asp Glu Pro He Val Met 
225 230 235 240 

Ala Gin He Pro Glu Glu Lys Pro Lys Pro Lys Ala lie Pro Ala Phe 
245 " 250 255 

Asp Asn Ala Tyr Asp Ala Asp Phe Asp Asn Ser Pro Pro Leu His His 
260 265 270 

Tyr Ser Ala Val His Leu Glu Thr Gly Leu Ser Pro Leu Glu Glu Ala 
275 280 285 

Gin Arg Ala Leu Arg Ala Asn Arg Ala Arg His Lys Pro Ser Asn Val 
290 295 300 



Ser Leu Ala Glu Glu Ala Lys Leu Ala Ala Arg Gin Arg Tyr Ser Asn 
305 310 315 320 



BNSDOCID: <WO 0073328A2J_> 



WO 00/73328 PCT/EP00/05108 

Ala Ser Asp He Arg Arg Glu Glu Glu Glu Gl« V.l Val Gl« Glu Asp 

pro Ala Val Val Val Pro Val Leu Arg Lys Asp Leu Glu Val Glu Glu 
340 345 J5U 

Ala Pro Lys Ser Val Arg Pro Pro Arg Tyr Arg Lys Ser Arg Glu He 



355 



360 



Glu Glu Pro Val Val Val Asp Arg Phe Val Glu Glu Glu Val Asc Glu 
370 375 38U 

Lys Glu Asp lie Asp Ala He Phe Glu Lys Tyr Arg Lys Thr Ser Val 
385 390 395 

Ser Ala Asp Pro Lys Ser His Thr Pro lie Leu Met Ala Asp Glu Tyr 
405 41U 

Lys Glu Pro Gin Lys Gin Val Pro Ala Pro Val Val Val Ala Gin Glu 
- - - 425 



420 



ser Pro He Leu Lys Arg Arg Asn Ser Leu Val Pro Ser Arg He Ser 

435 440 445 

Gly Arg Gin Ser Thr Arg Arg Ser Val Thr Ser Val Arg Ser Met Arg 

450 455 
Gly Lys Arg Lys Thr Arg Ala lie Pro Glu Phe Phe Asp Leu Thr Arg 
465 4 ™ 4/5 

His Gin Asn He Arg Leu Arg Ala Pro Ala Thr Lys Lys Lys Arg He 
485 4yu 

Ser Leu His Arg Val Glu Asp Thr Glu Val Val Val Glu Leu Leu Asn 
500 505 



Gly Gin Lys Val Glu Val Ala Cys Arg Ser Asp Val He Ser Arg Asp 

515 520 5 

Val Phe Ser Leu He Val Gin Asn Met Asn He Asn Glu His Val Phe 
530 "5 540 

Phe Gly Leu Ser Phe Leu Arg Asp Gly Glu His Tyr Phe He Glu Asp 
545 550 555 

His Gin Arg Leu Glu Lys Phe Ala Pro Ser Gly Trp Lys Ser Val Ala 



565 



Arg Val Gly Val Lys Val Pro Tyr Val Leu His Leu Arg Phe Lys Phe 



580 



Tyr Pro Gin He Leu Asp Phe He Lys Thr Asp Val Thr Met Asn Glu 
595 600 6Ub 



Leu Tyr Leu Gin Cys Arg Arg Asp Val Leu Glu Glu Arg He Gin Pro 

610 615 
Lys Arg Asp Ala Ala Phe Glu Leu Ala Ala Leu Ala Leu Gin Ala Glu 
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625 630 635 640 

Phe Glv Asn Arg Pro Pro Pro Val lie Thr Asp Tyr Phe Asp lie Gin 
645 650 655 

His Tyr Leu Pro Lys Lys Tyr Ser Ser Phe Glu Asp Gin Ser Arg Leu 
660 665 670 

Lys Asn lie Leu Ala Glu Leu His Gly His Tyr Ala Gly Thr Arg lie 
675 680 " 685 

Ser Glu Ala Lys His Lys Tyr lie Gin lie Cys Gin Arg His Pro Asp 
690 695 700 

Phe Gly Ala His Val His Arg Val Phe Arg Thr Lys Pro Thr Ser Ala 
705 710 715 720 

His Gly Ala Ser Pro Phe Asp Pro Asp Thr Gly Ser Ser Leu Trp lie 
725 730 735 

Gly He Met Pro Arg Gly He Ser He Tyr Glu Gin Gin Gly Gly Ala 
740 745 750 

Arg Glu Val He Ala Glu His Val Trp Pro Gin Thr Gin Thr Leu Gin 
755 760 765 

Phe Asp Lys Lys Arg Phe Val He Val Ala Val Gly Ala His Asp Glu 
770 775 780 

Gin He Glu Ser Thr Phe Tyr Thr Asp His His Ser Lys Ser Ser Tyr 
785 790 795 800 

Phe Val Arg Phe Ala Ala Ser Gin His Arg Trp Met Met Lys Met Arg 
805 810 815 

Gin Trp Lys Ser Thr Leu Arg His Glu Asn Thr He Gin Ala Met Pro 
820 825 830 

Asp Val lie Val Glu Gly Gin Thr lie Pro Pro Ala Pro He Arg Gin 
835 840 845 

Gly Leu Glu Glu Ser Pro Pro Ser Thr Pro Leu Leu Ala Ser Ala Asp 
850 855 860 

Gin He Phe Ala Lys Met Ser Val Ser Gin Glu Lys Pro Ala Ala Asn 
865 870 875 880 

Arg Pro Ala Glu Leu Pro Pro Pro Ala Pro Ser Ser Lys Phe Ala Ala 
885 890 895 

Gin Tyr Asp Thr Val Asp Glu Gly He Val Cys Asp Ser Gin Ala Glu 
900 905 910 

Asn Phe Glu Arg Val Asp Ser Thr Asp Asn Gly Asn Val Thr Pro Arg 
915 920 925 

Gly Met Gin Phe Asp He Leu Leu Val Lys Asp Pro Ala Asn Gly Leu 
930 935 940 
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Gly Leu Thr Leu Val Asp Gly Asn Leu Asn Gly Val Pro Gly Val Tyr 
945 950 955 960 

Val Lys Leu Val Ala Asp Asn Gly Ala Gly Met Lvs Ala Val Arg lie 
965 970 975 

Arg Asn Phe Ser Gin Tyr Pro Phe Ser Ser Gly Cys Thr Leu Glu Leu 
980 985 990 

Cys Lys Ser Thr His Asn Val Phe Ser lie lie Ser Glu Lys Ser 
995 1000 1005 



<210> 29 
<211> 1311 
<212> DNA 

<213> Caenorhabditis elegans 



<400> 29 

aaaccgaagg 

ccacttcatc 

cagagagctc 

gaggcgaagc 

gaggaagaag 

gaagttgagg 

gaggaaccgg 

gatgcgattt 

ccgattttga 

gttgctcagg 

ggaaggcagt 

acccgtgcga 

ccagctacga 

gagctcctga 

gtgttctctc 

ttccttcgtg 

ccgtccggat 

cgattcaaat 

ttgtatctac 

gcattcgaac 

atcaccgatt 

cagtcacggc 



ctatcccggc 
attactcggc 
tccgagccaa 
ttgctgccag 
tggtggaaga 
aggctccgaa 
tggttgttga 
ttgagaaata 
tggccgacga 
agtcgccgat 
cgacgcggcg 
ttcccgagtt 
aaaagaaacg 
acggccaaaa 
ttatcgtcca 
atggcgagca 
ggaaaagtgt 
tttatccgca 
aatgtagacg 
tcgccgcgct 
atttcgatat 
tgaagaacat 



ttttgataac 
ggttcatttg 
ccgagcaagg 
acagagatac 
agatccagca 
atcggtacga 
tcggtttgtt 
tcgaaagacg 
gtacaaagaa 
cttaaaacga 
aagtgtcacc 
cttcgaccta 
cataagccta 
agttgaagtg 
gaacatgaat 
ctactttata 
ggcacgcgtg 
gattttggat 
agacgtactc 
cgccctccaa 
tcaacactac 
tctcgccgag 



gcctatgacg 
gaaactgggc 
cataagccgt 
tcgaatgcat 
gtggtagttc 
ccgcctagat 
gaggaagaag 
tctgtatccg 
ccacagaagc 
agaaactctc 
agtgttcgga 
acccgacarc 
caccgcgtcg 
gcatgccgaa 
atcaatgagc 
gaggatcatc 
ggcgttaagg 
ttcatcaaaa 
gaagagagaa 
gccgaatttg 
ctgcccaaga 
ctgcacgggc 



ccgactttga 
tctcaccgct 
cgaatgtatc 
cggatattag 
cagttctacg 
atcggaagtc 
ttgatgagaa 
ccgatccaaa 
aagtgcctgc 
tagtaccatc 
gtatgagagg 
agaacattcg 
aggacaccga 
gtgacgtcat 
acgtgttctt 
aacggctcga 
tgccatacgt 
cagacgtaac 
tccaacccaa 
gaaatcgacc 
agtatagctc 
attacgcggg 



caattcccca 
tgaggaggct 
gttggcagag 
aagagaagag 
gaaggatctg 
aagggaaatt 
ggaggatatt 
atctcacacg 
accagttgtc 
aaggatcagt 
aaaacggaag 
gctccgtgct 
agtcgtcgtc 
ctcgcgtgac 
cgggctctcg 
aaagtttgca 
gttgcactta 
gatgaacgag 
aagagacgca 
gccaccggtg 
atttgaagat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1311 



<210> 30 
<211> 437 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 30 

Lys Pro Lys Ala lie Pro Ala Phe Asp Asn Ala Tyr Asp Ala Asp Phe 
15 10 15 

Asp Asn Ser Pro Pro Leu His His Tyr Ser Ala Val His Leu Glu Thr 
20 25 30 



Gly Leu Ser Pro Leu Glu Glu Ala Gin Arg Ala Leu Arg Ala Asn Arg 
35 40 45 
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Ala Arg His Lys Pro Ser Asn Val Ser Leu Ala Glu Glu Ala Lys Leu 
50 55 60 

Ala Ala Arg Gin Arg Tyr Ser Asn Ala Ser Asp He Arg Arg Glu Glu 
65 70 75 80 

-. ^ -. ,~ I _ rr-T "> y-T., 7\ — ^ T> ~ ^» ^ 1 -5 Wo 1 Us 1 Wj=> 1 D>~r\ V.3 1 1.011 
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85 90 95 

Arg Lys Asp Leu Glu Val Glu Glu Ala Pro Lys Ser Val Arg Pro Pro 
100 105 HO 

Arg Tyr Arg Lys Ser Arg Glu He Glu Glu Pro Val Val Val Asp Arg 
115 120 125 

Phe Val Glu Glu Glu Val Asp Glu Lys Glu Asp He Asp Ala He Phe 
130 135 140 

Glu Lys Tyr Arg Lys Thr Ser Val Ser Ala Asp Pro Lys Ser His Thr 
145 ~ 150 155 160 

Pro He Leu Met Ala Asp Glu Tyr Lys Glu Pro Gin Lys Gin Val Pro 
165 170 175 

Ala Pro Val Val Val Ala Gin Glu Ser Pro He Leu Lys Arg Arg Asn 
180 185 190 

Ser Leu Val Pro Ser Arg lie Ser Gly Arg Gin Ser Thr Arg Arg Ser 
195 200 205 

Val Thr Ser Val Arg Ser Met Arg Gly Lys Arg Lys Thr Arg Ala lie 
210 215 220 

Pro Glu Phe Phe Asp Leu Thr Arg His Gin Asn He Arg Leu Arg Ala 
225 230 235 240 

Pro Ala Thr Lys Lys Lys Arg lie Ser Leu His Arg Val Glu Asp Thr 
245 250 255 

Glu Val Val Val Glu Leu Leu Asn Gly Gin Lys Val Glu Val Ala Cys 
260 265 270 

Arg Ser Asp Val lie Ser Arg Asp Val Phe Ser Leu lie Val Gin Asn 
275 280 285 

Met Asn lie Asn Glu His Val Phe Phe Gly Leu Ser Phe Leu Arg Asp 
290 295 300 

Gly Glu His Tyr Phe lie Glu Asp His Gin Arg Leu Glu Lys Phe Ala 
305 310 315 320 

Pro Ser Gly Trp Lys Ser Val Ala Arg Val Gly Val Lys Val Pro Tyr 
325 330 335 

Val Leu His Leu Arg Phe Lys Phe Tyr Pro Gin lie Leu Asp Phe lie 
340 345 350 

Lys Thr Asp Val Thr Met Asn Glu Leu Tyr Leu Gin Cys Arg Arg Asp 
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355 



360 



365 



Val Leu Glu Glu Arg lie Gin Pro Lys Arg Asp Ala Ala Phe Glu Leu 
370 375 380 

Ala Ala Leu Ala Leu Gin Ala Glu Phe Gly Asn Arg Pro Pro Pro Val 
385 390 395 400 

He Thr Asp Tyr Phe Asp He Gin His Tyr Leu Pro Lys Lys Tyr Ser 
405 410 415 

Ser Phe Glu Asp Gin Ser Arg Leu Lys Asn He Leu Ala Glu Leu His 
420 425 430 



Gly His Tyr Ala Gly 
435 



<210> 31 

<211> 2574 

<212> DNA 

<213> Caenorhabditis elegans 



<400> 31 

atgcctgcta 

gttcgaaagg 

cgtaaagtca 

gaagaaatca 

aactgttcga 

ggccgtgatg 

gaaatggtcc 

gctactaata 

tcagaggaaa 

ggtccagcaa 

gatgagatag 

cttgccaata 

agtgaactat 

cttagcgaaa 

caaaaaagtg 

aaaagttcag 

caaacatccg 

aaagattcaa 

atatcrcgag 

gtggcaagcc 

cagtcgcgtg 

cgcaatcatt 

gctgaagtcg 

attatcgtcg 

atggtccgcc 

cttcgggatc 

aaacaatctg 

attttaaaag 

atattgcggg 

ggatcagctg 

atagttggac 

caatatttcg 

aaacactttc 

tctgagcagg 



atgagctttt 
tgcaaccaaa 
aaatcgagct 
gacggcgtct 
ctttggacga 
gaatacaccg 
agcagcagtc 
acctcagcac 
tcgctaaatc 
aaacaatttc 
gagtttcaca 
agttcaaaca 
acattcagct 
aaggtgacat 
aacaaaatcc 
cttcagcgtt 
attcctttga 
aaattttcaa 
atcgaaccgc 
aacaatgtcc 
ctcaagaaga 
cagtgcttcg 
taaaaactca 
acttaggaaa 
tttgtgatcc 
ctgtcgagtc 
ttaaagtggt 
atgaagaacg 
ccgtgatggt 
cagtctacgt 
aagctaaaca 
tggatgtaca 
cctcgaatgt 
atgtgatgtt 



cggagattct 
gtcgatcaat 
tcaagctgtt 
gctggattct 
tcttcttcaa 
atatgggcca 
gagctcaaaa 
tcatgggtca 
gagaggaaca 
tcgcgtaaaa 
tcctgatgag 
gttgtattgt 
caatgttcct 
cttcgttgat 
gtcagcagat 
tgaacaatct 
cgggttcaac 
ctcgaatcaa 
tgatatgacc 
ggtatcttcg 
cgacaacaac 
tcatcgtacg 
tggtagcatt 
gaacattttc 
ccacgttgac 
tgatttacgt 
caagcctgca 
tgaagaaatg 
agcgcctatt 
ttatcgacaa 
agatggtgaa 
tctatctcat 
tgtattcgca 
cgaaaacatt 



gatccggaag 
gtaactgaga 
ttggttgcgg 
tatggtgcat 
gcatgctcgg 
agaactactg 
cgcccggccc 
tcattccggg 
cctgagcaat 
aaccttgcag 
ccaaatcgca 
ttaccagcat 
cctttcaacg 
tgtattgatc 
gtttctattc 
gtagtatccg 
agtttcgaag 
gaaagcatcg 
gacattgcat 
tcactcgttc 
acatctattc 
atcaagcctc 
ccttatgaag 
aaagtttatg 
acgactacat 
tggatgacac 
atgtttattg 
gacgttgaaa 
gtggagctcc 
ggagcggagg 
gtgttggttt 
ttgttcccga 
acattgcatg 
gataatgatg 



gggatgaaat 
atcaaaaggt 
aaaaaatacc 
gtcctgatcc 
agtcaattgt 
aagccaacca 
gctcgttttt 
cattcagagg 
tcaaagcaag 
aggttttgaa 
agattgtaac 
ggggaaagaa 
aatatctgca 
gtgacaatgc 
aatctgaatc 
ctccttcaac 
tgcccccaga 
atgactatcc 
tgcgctttgg 
cacaaaatgg 
taactattca 
gcaatccaac 
cgctttcgga 
caactcaacc 
tgccactcct 
tgggaaattc 
cgccacgcgg 
aggttgctac 
agcgtgaaac 
tgcgctatta 
tgttggctga 
ttccggaaga 
gagtactggg 
atacaaaacg 



ttttgaaaga 
agaccctatg 
aattagtaca 
gaaaagatat 
acacactttc 
agatattatc 
aggttcagga 
tccgtatgcg 
acacaagttg 
agaatatgct 
actggcagct 
catatcggaa 
tttctggcgt 
cgatccaact 
ttttggcggt 
tattagagat 
aaatggaagc 
aggaaatgct 
aactgtctct 
aattcttcgt 
atcatctcgt 
acaaaatctt 
ttgtgataag 
tggagaaatg 
agagaacaat 
ccatatcaag 
atttttgttg 
tatggggaat 
cgtccgcacg 
tagagtgttg 
tgtcgatgat 
agccagcttc 
tttgacgctt 
atttgtgggt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 
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ggatattttc atggaaacga tgacagaatt cttaacatcg atatggtttg gaaaaatgag 2100 
cgtggtcaat ttgagtggct ttcacaaatt gtcaagagac gtggagcagt tacttcgtcc 2160 
gatgccaaca tcattcactt tccacattct gcactcgatg tgataaagtc tgttggacca 2220 
gattgttcgg tgtgctttgt cgactattca gttcgtgatg aatctgcaac atcttcattg 2280 
atggaatcga ctagaattgt tcatgatagt cgtgaatcta tgacaactac ttatgttggt 2340 
gagat.gccaa gcccaattat cgaagaaatc gatgcaacat cttcatttga cccaaaactc 2400 

, , _ _i .a_ j -A— j j- ^ a — . a_ j _ „ -» ^, rri-far t- a t- rr .a f- f rr 1" rr 9d£0 

LLyaatLLL-L OLU^aL <_ v_ \^ya(.aay «_ <~c* <-* ^^v^^^^^ — — — — ~ z> 3 => ~ =» 

gggatgttcc aattcgttcg aagtttgaag gatttattcg gcgataacaa tgaatgggaa 2520 
cgcttgctca cgtacatgct cacgactggt aaaaacaata acattcgtct ctga 2574 

<210> 32 
<211> 857 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 32 

Met Pro Ala Asn Glu Leu Phe Gly Asp Ser Asp Pro Glu Gly Asp Glu 
15 10 15 

lie Phe Glu Arg Val Arg Lys Val Gin Pro Lys Ser lie Asn Val Thr 
20 25 30 

Glu Asn Gin Lys Val Asp Pro Met Arg Lys Val Lys lie Glu Leu Gin 
35 40 45 

Ala Val Leu Val Ala Glu Lys lie Pro He Ser Thr Glu Glu He Arg 
50 55 60 

Arg Arg Leu Leu Asp Ser Tyr Gly Ala Cys Pro Asp Pro Lys Arg Tyr 
65 " 70 75 80 

Asn Cys Ser Thr Leu Asp Asp Leu Leu Gin Ala Cys Ser Glu Ser He 

85 90 95 

Val His Thr Phe Gly Arg Asp Gly He His Arg Tyr Gly Pro Arg Thr 
100 105 110 

Thr Glu Ala Asn Gin Asp He He Glu Met Val Gin Gin Gin Ser Ser 
115 120 125 

Ser Lys Arg Pro Ala Arg Ser Phe Leu Gly Ser Gly Ala Thr Asn Asn 
130 " 135 140 

Leu Ser Thr His Gly Ser Ser Phe Arg Ala Phe Arg Gly Pro Tyr Ala 
145 150 155 160 

Ser Glu Glu He Ala Lys Ser Arg Gly Thr Pro Glu Gin Phe Lys Ala 
165 170 175 

Arg His Lys Leu Gly Pro Ala Lys Thr He Ser Arg Val Lys Asn Leu 
180 185 190 

Ala Glu Val Leu Lys Glu Tyr Ala Asp Glu He Gly Val Ser His Pro 
195 200 205 

Asp Glu Pro Asn Arg Lys lie Val Thr Leu Ala Ala Leu Ala Asn Lys 
210 215 220 



BNSDOaD: <WO 007332SA2_I_> 



WO 00/73328 



PCT7EP00/05108 



Phe Lys Gin Leu 
225 

Ser Glu Leu Tyr 



His Phe Trp Arg 
260 



Asp Arg Asp Asn 
27 5 

Ala Asp Val Ser 
290 

Ser Ala Phe Glu 
305 

Gin Thr Ser Asp 



Glu Asn Gly Ser 
340 

lie Asp Asp Tyr 
355 



Met Thr Asp He 
370 

Gin Cys Pro Val 
385 

Gin Ser Arg Ala 



Gin Ser Ser Arg 
420 

Pro Arg Asn Pro 
435 

Ser He Pro Tyr 
450 

Leu Gly Lys Asn 
465 

Met Val Arg Leu 



Leu Glu Asn Asn 
500 

Thr Leu Gly Asn 
515 

Pro Ala Met Phe 



Tyr Cys 
230 

He Gin 
245 

Leu Ser 

Ala Asp 

He Gin 

Gin Ser 
310 

Ser Phe 
325 

Lys Asp 
Pro Gly 



Ala Leu 

Ser Ser 
390 

Gin Glu 
405 

Arg Asn 

Thr Gin 

Glu Ala 

He Phe 
470 

Cys Asp 
485 

Leu Arg 
Ser His 
He Ala 



Leu Pro 

Leu Asn 

Glu Lys 

Pro Thr 
280 

Ser Glu 
295 

Val Val 

Asp Gly 

Ser Lys 

Asn Ala 
360 

Arg Phe 
375 

Ser Leu 

Asp Asp 

His Ser 

Asn Leu 
440 

Leu Ser 
455 

Lys Val 

Pro His 

Asp Pro 

lie Lys 
520 

Pro Arg 



52 

Ala Trp 

Val Pro 
250 

Gly Asp 
265 

Gin Lys 

Ser Phe 

Ser Ala 

Phe Asn 
330 

He Phe 
345 

He Ser 

Gly Thr 

Val Pro 

Asn Asn 
410 

Val Leu 
425 

Ala Glu 



Asp Cys 



Tyr Ala 

Val Asp 
490 

Val Glu 
505 

Lys Gin 
Gly Phe 



Gly Lys 
235 

Pro Phe 

lie Phe 

Ser Glu 

Gly Gly 
300 

Pro Ser 
315 

Ser Phe 

Asn Ser 

Arg Asp 

Val Ser 
380 

Gin Asn 
395 

Thr Ser 

Arg His 

Val Val 

Asp Lys 
460 

Thr Gin 
475 

Thr Thr 
Ser Asp 
Ser Val 
Leu Leu 



Asn He 



Asn Glu 

Val Asp 
270 

Gin Asn 
285 

Lys Ser 

Thr He 

Glu Val 

Asn Gin 
350 

Arg Thr 
365 

Val Ala 

Gly lie 

lie Leu 

Arg Thr 
430 

Lys Thr 
445 

He lie 

Pro Gly 

Thr Leu 

Leu Arg 
510 

Lys Val 
525 

lie Leu 



Ser Glu 
240 

Tyr Leu 
255 

Cys lie 

Pro Ser 

Ser Ala 

Arg Asp 
320 

Pro Pro 
335 

Glu Ser 



Ala Asp 



Ser Gin 

Leu Arg 
400 

Thr lie 
415 

lie Lys 

His Gly 

Val Asp 

Glu Met 
480 

Pro Leu 
495 

Trp Met 
Val Lys 
Lys Asp 



BNSDOCID: <WO___0073328A2_L> 



WO 00/73328 



PCT/EP00/05108 



53 



530 



535 



540 



Glu Glu Arg Glu Glu Met Asp Val Glu Lys Val Ala Thr Met Gly Asn 
545 550 555 560 

He Leu Arg Ala Val Met Val Ala Pro He Val Glu Leu Gin Arg Glu 
565 570 575 

Thr Val Arg Thr Gly Ser Ala Ala Val Tyr Val Tyr Arg Gin Gly Ala 
580 585 590 

Glu Val Arg Tyr Tyr Arg Val Leu He Val Gly Gin Ala Lys Gin Asp 
595 600 605 

Gly Glu Val Leu Val Leu Leu Ala Asp Val Asp Asp Gin Tyr Phe Val 
610 615 620 

Asp Val His Leu Ser His Leu Phe Pro He Pro Glu Glu Ala Ser Phe 
625 630 635 640 

Lys His Phe Pro Ser Asn Val Val Phe Ala Thr Leu His Gly Val Leu 
645 650 655 

Gly Leu Thr Leu Ser Glu Gin Asp Val Met Phe Glu Asn He Asp Asn 
660 665 670 

Asp Asp Thr Lys Arg Phe Val Gly Gly Tyr Phe His Gly Asn Asp Asp 
675 " 680 685 

Arg He Leu Asn He Asp Met Val Trp Lys Asn Glu Arg Gly Gin Phe 
690 695 700 

Glu Trp Leu Ser Gin He Val Lys Arg Arg Gly Ala Val Thr Ser Ser 
705 710 715 720 

Asp Ala Asn He lie His Phe Pro His Ser Ala Leu Asp Val He Lys 
725 730 735 

Ser Val Gly Pro Asp Cys Ser Val Cys Phe Val Asp Tyr Ser Val Arg 
740 745 750 

Asp Glu Ser Ala Thr Ser Ser Leu Met Glu Ser Thr Arg lie Val His 
755 760 765 

Asp Ser Arg Glu Ser Met Thr Thr Thr Tyr Val Gly Glu Met Pro Ser 
770 775 780 

Pro lie He Glu Glu He Asp Ala Thr Ser Ser Phe Asp Pro Lys Leu 
785 790 795 800 

Leu Asn Leu His Ser Leu Phe Asp Lys Leu He Glu Glu Gin Asn Val 
805 810 815 

Thr Met He Val Gly Met Phe Gin Phe Val Arg Ser Leu Lys Asp Leu 
820 825 830 



Phe Giv Asp Asn Asn Glu Trp Glu Arg Leu Leu Thr Tyr Met Leu Thr 
835 840 845 
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Thr Gly Lys Asn Asn Asn lie Arg Leu 
850 855 



<210> 33 
<211> 1587 
<212> DNA 

<213> Caenorhabditis elegans 



<400> 33 

atttttgaaa 

gtagacccta 

ccaattagta 

ccgaaaagat 

gtacacactt 

caagatatta 

ttaggttcag 

ggtccgtatg 

agacacaagt 

aaagaatatg 

acactggcag 

aacatatcgg 

catttctggc 

gccgatccaa 

tcttttggcg 

actattagag 

gaaaatggaa 

ccaggaaatg 

ggaactgtct 

ggaattcttc 

caatcatctc 

acacaaaatc 

gattgtgata 

cctggagaaa 

ctagagaaca 

tcccatatca 

ggatttttgt 



gagttcgaaa 
tgcgtaaagt 
cagaagaaat 
ataactgttc 
tcggccgtga 
tcgaaatggt 
gagctactaa 
cgtcagagga 
tgggtccagc 
ctgatgagat 
ctcttgccaa 
aaagtgaact 
gtcttagcga 
ctcaaaaaag 
gtaaaagttc 
atcaaacatc 
gcaaagattc 
ctatatctcg 
ctgtggcaag 
gtcagtcgcg 
gtcgcaatca 
ttgctgaagt 
agattatcgt 
tgatggtccg 
atcttcggga 
agaaacaatc 
tgattttaaa 



ggtgcaacca 
caaaatcgag 
cagacggcgt 
gactttggac 
tggaatacac 
ccagcagcag 
taacctcagc 
aatcgctaaa 
aaaaacaatt 
aggagtttca 
taagttcaaa 
atacattcag 
aaaaggtgac 
tgaacaaaat 
agcttcagcg 
cgattccttt 
aaaaattttc 
agatcgaacc 
ccaacaatgt 
tgctcaagaa 
ttcagtgctt 
cgtaaaaact 
cgacttagga 
cctttgtgat 
tcctgtcgag 
tgttaaagtg 
agatgaa 



aagtcgatca 
cttcaagctg 
ctgctggatt 
gatcttcttc 
cgatatgggc 
tcgagctcaa 
actcatgggt 
tcgagaggaa 
tctcgcgtaa 
catcctgatg 
cagttgtatt 
ctcaatgttc 
atcttcgttg 
ccgtcagcag 
tttgaacaat 
gacgggttca 
aactcgaatc 
gctgatatga 
ccggtatctt 
gacgacaaca 
cgtcatcgta 
catggtagca 
aagaacattt 
ccccacgttg 
tctgatttac 
gtcaagcctg 



atgtaactga 
ttttggttgc 
cttatggtgc 
aagcatgctc 
caagaactac 
aacgcccggc 
catcattccg 
cacctgagca 
aaaaccttgc 
agccaaatcg 
gtttaccagc 
ctcctttcaa 
attgtattga 
atgtttctat 
ctgtagtatc 
acagtttcga 
aagaaagcat 
ccgacattgc 
cgtcactcgt 
acacatctat 
cgatcaagcc 
ttccttatga 
tcaaagttta 
acacgactac 
gttggatgac 
caatgtttat 



gaatcaaaag 
ggaaaaaata 
atgtcctgat 
ggagtcaatt 
tgaagccaac 
ccgctcgttt 
ggcattcaga 
attcaaagca 
agaggttttg 
caagattgta 
atggggaaag 
cgaatatctg 
tcgtgacaat 
tcaatctgaa 
cgctccttca 
agtgccccca 
cgatgactat 
attgcgcttt 
tccacaaaat 
tctaactatt 
tcgcaatcca 
agcgctttcg 
tgcaactcaa 
attgccactc 
actgggaaat 
tgcgccacgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1587 



<210> 34 
<211> 529 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 34 

lie Phe Glu Arg Val Arg Lys Val Gin Pro Lys Ser He Asn Val Thr 
15 10 15 

Glu Asn Gin Lys Val Asp Pro Met Arg Lys Val Lys He Glu Leu Gin 
20 25 30 

Ala Val Leu Val Ala Glu Lys He Pro He Ser Thr Glu Glu He Arg 
35 40 45 

Arg Arg Leu Leu Asp Ser Tyr Gly Ala Cys Pro Asp Pro Lys Arg Tyr 
50 55 60 



Asn Cys Ser Thr Leu Asp Asp Leu Leu Gin Ala Cys Ser Glu Ser lie 
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65 70 75 80 

Val His Thr Phe Gly Arg Asp Gly lie His Arg Tyr Gly Pro Arg Thr 
85 90 95 

Thr Glu Ala Asn Gin Asp lie lie Glu Met Val Gin Gin Gin Ser Ser 
100 105 110 

Ser Lys Arg Pro Ala Arg Ser Phe Leu Gly Ser Gly Ala Thr Asn Asn 
115 120 125 

Leu Ser Thr His Gly Ser Ser Phe Arg Ala Phe Arg Gly Pro Tyr Ala 
130 135 140 

Ser Glu Glu lie Ala Lys Ser Arg Gly Thr Pro Glu Gin Phe Lys Ala 
145 150 " 155 160 

Arg His Lys Leu Gly Pro Ala Lys Thr lie Ser Arg Val Lys Asn Leu 
165 170 175 

Ala Glu Val Leu Lys Glu Tyr Ala Asp Glu He Gly Val Ser His Pro 
180 185 190 

Asp Glu Pro Asn Arg Lys He Val Thr Leu Ala Ala Leu Ala Asn Lys 
195 200 205 

Phe Lys Gin Leu Tyr Cys Leu Pro Ala Trp Gly Lys Asn He Ser Glu 
210 215 220 

Ser Glu Leu Tyr He Gin Leu Asn Val Pro Pro Phe Asn Glu Tyr Leu 
225 230 235 240 

His Phe Trp Arg Leu Ser Glu Lys Gly Asp He Phe Val Asp Cys He 
245 250 255 

Asp Arg Asp Asn Ala Asp Pro Thr Gin Lys Ser Glu Gin Asn Pro Ser 
260 265 270 

Ala Asp Val Ser He Gin Ser Glu Ser Phe Gly Gly Lys Ser Ser Ala 
275 280 285 

Ser Ala Phe Glu Gin Ser Val Val Ser Ala Pro Ser Thr He Arg Asp 
290 295 300 

Gin Thr Ser Asp Ser Phe Asp Gly Phe Asn Ser Phe Glu Val Pro Pro 
305 310 315 320 

Glu Asn Gly Ser Lys Asp Ser Lys He Phe Asn Ser Asn Gin Glu Ser 
325 330 335 

lie Asp Asp Tyr Pro Gly Asn Ala He Ser Arg Asp Arg Thr Ala Asp 
340 345 350 

Met Thr Asp lie Ala Leu Arg Phe Gly Thr Val Ser Val Ala Ser Gin 
355 360 365 

Gin Cys Pro Val Ser Ser Ser Leu Val Pro Gin Asn Gly lie Leu Arg 
370 375 380 
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Gin Ser Arg Ala 
385 

Gin Ser Ser Arg 



Pro Arg Asn Pro 
420 



Gin Glu 
390 

Arg Asn 
405 



Asp Asp 



His Ser 



56 
Asn Asn 



Val Leu 
410 



Thr Gin Ash Leu 



Thr Ser lie Leu Thr lie 
395 400 

Arg His Arg Thr lie Lys 
415 



Ala Glu Val Val Lys Thr His Gly 
425 430 



Ser lie Pro Tyr Glu Ala 
435 



Leu Ser 
440 



Leu Gly Lys Asn 
450 

Met Val Arg Leu 
465 



lie Phe 



Cys Asp 
470 



Asp Cys 
Tyr Ala 
Pro His Val Asp 



Lys Val 
455 



Asp Lys lie lie Val Asp 
445 



Thr Gin 
460 

Thr Thr 
475 



Pro Gly Glu Met 



Thr Leu Pro Leu 
480 



Leu Glu Asn Asn 



Thr Leu Gly Asn 
500 



Leu Arg Asp Pro 
485 

Ser His lie Lys 



Val Glu Ser Asp Leu Arg Trp Met 
490 495 

Lys Gin Ser Val Lys Val Val Lys 
505 510 



Pro Ala Met Phe 
515 



He Ala 



Pro Arg 
520 



Gly Phe Leu Leu 



He Leu Lys Asp 
525 



Glu 



<210> 35 
<211> 1593 
<212> DNA 

<213> Caenorhabditis elegans 



<400> 35 

atgcgcattg 

tgttgtctgt 

tttcgaaatt 

cgcggtctac 

gacggaggag 

ccaaaagatg 

ccgtactcta 

aagcgattag 

ttatgggaga 

ggacaaatga 

aagaagaaga 

ttggcagcta 

gcctcaagta 

cgttcggctt 

gccggtgaaa 

tgcgtatttc 

agcaatcaca 

gcagcgacaa 

ctcgatcagc 

caacatggag 



tccgtacaca 
tgatcaactc 
ctcactcttt 
ggagagcaga 
taccagaagt 
agatcgagct 
gtagaccact 
cctcactggg 
ggaagggtca 
ccagagacaa 
aggccaagaa 
tcgaggagaa 
cacggcatca 
cttccatttt 
caaatggtca 
atcacgatta 
gcgacgtaat 
taatgagaaa 
aaactccaag 
acttccacgt 



ccgcgacgag 
aattgagaaa 
cgccatcgaa 
tttagtcaaa 
aataggatgc 
tggtgaagga 
tcatcacata 
aagtgcacga 
gtccctgatc 
gcaggcttta 
agcaaaacaa 
aaagcgagga 
cacattctcg 
aacacacact 
gtacgacaac 
cgacgctgac 
agttccagct 
cgggacaccg 
gttttcattg 
ggaaactaga 



tttttacgga 
tcaaaacaaa 
aagttcaaaa 
cgacatcgcc 
ccagttttag 
gtcaagatga 
tgctatcaac 
ggatggacag 
ggaaagttct 
tatgagaaag 
ctgccccagc 
gacgctgatg 
acgacgacac 
attggaagaa 
aatcaggagc 
gatcaaatag 
ccacttccac 
aaggttacaa 
tcttcttcga 
atttccgagc 



cattgtgctt 
ttcaatcttc 
gaaaacaaca 
actcaacgac 
atcctattat 
cgtgcacttg 
tgctcgagga 
tgccacaacg 
gccgatgtcg 
agaaggctgt 
tacaatttaa 
tattccactc 
gatcgcgact 
cgtggtccga 
cacatccatc 
atacggattt 
cacttcaggc 
attattcacc 
gtggaggaga 
atctcaacgc 



gaacttgttt 
agcgtatttt 
aaaaatgcct 
aggagacaaa 
ctgccaatgt 
ggaatcatgc 
caatcttgtc 
gaggaataac 
ctgcgatcgg 
ggaaaaagag 
ttctaaacct 
accgtccatt 
tcatactgat 
atcttcgttt 
aaattgtgaa 
cgagtgtgaa 
gaaaagctat 
ggatagtggt 
tgtcgataat 
gttgggactc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 
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agcataatgt cgccggtgga gaatgcgaat gaaaatgtca attatgaaga atcgccgttc 1260 
tacccggagc tgacatcgac tccaatcgtc tcgaagaagc agcgggaacc tctccgagcg 1320 
aaaaagagca catctgtctc gaagcttcca cttgctccgt cgtcacagct attcaatgaa 1380 
gaatcgcgtt gtggattcag attcaatgtg ccggttcgcg aaatgatgga catatggcaa 1440 
gagtctggag ccttgtcgcc ggcaattcga gaaacacagg ctgaaaatac tgaaaaaaga 1500 
gctgagaatg cgtcgggtgt actccaatat ggatggactc cattcttcgg caatggcttc 1560 
aaLuLcyyay agcgcctcta ctacttccca t2.g 1593 

<210> 36 
<211> 530 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 36 

Met Arg lie Val Arg Thr His Arg Asp Glu Phe Leu Arg Thr Leu Cys 
1 5 . 10 15 

Leu Asn Leu Phe Cys Cys Leu Leu lie Asn Ser He Glu Lys Ser Lys 
20 25 30 

Gin lie Gin Ser Ser Ala Tyr Phe Phe Arg Asn Ser His Ser Phe Ala 
35 40 45 

He Glu Lys Phe Lys Arg Lys Gin Gin Lys Met Pro Arg Gly Leu Arg 
50 55 60 

Arg Ala Asp Leu Val Lys Arg His Arg His Ser Thr Thr Gly Asp Lys 
65 70 " 75 80 

Asp Gly Gly Val Pro Glu Val He Glv Cys Pro Val Leu Asp Pro He 
85 90 95 

He Cys Gin Cys Pro Lys Asp Glu lie Glu Leu Gly Glu Gly Val Lys 
100 105 110 

Met Thr Cys Thr Trp Glu Ser Cys Pro Tyr Ser Ser Arg Pro Leu His 
115 120 125 

His lie Cys Tyr Gin Leu Leu Glu Asp Asn Leu Val Lys Arg Leu Ala 
130 135 140 

Ser Leu Gly Ser Ala Arg Gly Trp Thr Val Pro Gin Arg Arg Asn Asn 
145 150 ' 155 160 

Leu Tro Glu Arg Lys Gly Gin Ser Leu lie Gly Lys Phe Cys Arg Cys 
165 170 175 

Arg Cvs Asp Arg Gly Gin Met Thr Arg Asp Lys Gin Ala Leu Tyr Glu 
180 185 190 

Lys Glu Lys Ala Val Glu Lys Glu Lys Lys Lys Lys Ala Lys Lys Ala 
195 200 ~ 205 

Lys Gin Leu Pro Gin Leu Gin Phe Asn Ser Lys Pro Leu Ala Ala lie 
210 215 220 

Glu Glu Lys Lys Arg Gly Asp Ala Asp Val Phe His Ser Pro Ser lie 
225 230 235 240 
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Ala Ser Ser Thr Arg His His Thr Phe Ser Thr Thr Thr Arg Ser Arg 
245 250 255 

Leu His Thr Asp Arg Ser Ala Ser Ser He Leu Thr His Thr He Gly 
260 265 270 

Arg Thr Trp Ser Glu Ser Ser Phe Ala Gly Glu Thr Asn Gly Gin Tyr 
275 280 285 

Asp Asn Asn Gin Glu Pro His Pro Ser Asn Cys Glu Cys Val Phe His 
290 295 300 

His Asp Tyr Asp Ala Asp Asp Gin He Asp Thr Asp Phe Glu Cys Glu 
305 * 310 315 320 

Ser Asn His Ser Asp Val He Val Pro Ala Pro Leu Pro Pro Leu Gin 
325 330 335 

Ala Lys Ser Tyr Ala Ala Thr He Met Arg Asn Gly Thr Pro Lys Val 
340 345 350 

Thr Asn Tyr Ser Pro Asp Ser Gly Leu Asp Gin Gin Thr Pro Arg Phe 
355 360 365 

Ser Leu Ser Ser Ser Ser Gly Gly Asp Val Asp Asn Gin His Gly Asp 
370 375 380 

Phe His Val Glu Thr Arg He Ser Glu His Leu Asn Ala Leu Gly Leu 
385 390 395 400 

Ser He Met Ser Pro Val Glu Asn Ala Asn Glu Asn Val Asn Tyr Glu 
405 410 415 

Glu Ser Pro Phe Tyr Pro Glu Leu Thr Ser Thr Pro He Val Ser Lys 
420 425 430 

Lys Gin Arg Glu Pro Leu Arg Ala Lys Lys Ser Thr Ser Val Ser Lys 
435 440 445 

Leu Pro Leu Ala Pro Ser Ser Gin Leu Phe Asn Glu Glu Ser Arg Cys 
450 455 460 

Gly Phe Arg Phe Asn Val Pro Val Arg Glu Met Met Asp He Trp Gin 
465 470 475 480 

Glu Ser Gly Ala Leu Ser Pro Ala He Arg Glu Thr Gin Ala Glu Asn 
485 490 495 

Thr Glu Lys Arg Ala Glu Asn Ala Ser Gly Val Leu Gin Tyr Gly Trp 
500 505 510 

Thr Pro Phe Phe Gly Asn Gly Phe Asn Leu Gly Glu Arg Leu Tyr Tyr 
515 * 520 525 

Phe Pro 
530 
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<210> 37 
<211> 1458 
<212> DNA 

<213> Caenorhabditis elegans 



< / l00> 37 

tctttcgcca 

gcagatttag 

gaagtaatag 

gagcttggtg 

ccacttcatc 

ctgggaagtg 

ggtcagtccc 

gacaagcagg 

aagaaagcaa 

gagaaaaagc 

catcacacat 

attttaacac 

ggtcagtacg 

gattacgacg 

gtaatagttc 

agaaacggga 

ccaaggtttt 

cacgtggaaa 

gtggagaatg 

tcgactccaa 

gtctcgaagc 

ttcagattca 

tcgccggcaa 

ggtgtactcc 

ctctactact 



tcgaaaagtt 
tcaaacgaca 
gatgcccagt 
aaggagtcaa 
acatatgcta 
cacgaggatg 
tgatcggaaa 
ctttatatga 
aacaactgcc 
gaggagacgc 
tctcgacgac 
acactattgg 
acaacaatca 
ctgacgatca 
cagctccact 
caccgaaggt 
cattgtcttc 
ctagaatttc 
cgaatgaaaa 
tcgtctcgaa 
ttccacttgc 
atgtgccggt 
ttcgagaaac 
aatatggatg 
tcccatag 



caaaagaaaa 
tcgccactca 
tttagatcct 
gatgacgtgc 
tcaactgctc 
gacagtgcca 
gttctgccga 
gaaagagaag 
ccagctacaa 
tgatgtattc 
gacacgatcg 
aagaacgtgg 
ggagccacat 
aatagatacg 
tccaccactt 
tacaaattat 
ttcgagtgga 
cgagcatctc 
tgtcaattat 
gaagcagcgg 
tccgtcgtca 
tcgcgaaatg 
acaggctgaa 
gactccattc 



caacaaaaaa 
acgacaggag 
attatctgcc 
acttgggaat 
gaggacaatc 
caacggagga 
tgtcgctgcg 
gctgtggaaa 
tttaattcta 
cactcaccgt 
cgacttcata 
tccgaatctt 
ccatcaaatt 
gatttcgagt 
caggcgaaaa 
tcaccggata 
ggagatgtcg 
aacgcgttgg 
gaagaatcgc 
gaacctctcc 
cagctattca 
atggacatat 
aatactgaaa 
ttcggcaatg 



tgcctcgcgg 
acaaagacgg 
aatgtccaaa 
catgcccgta 
ttgtcaagcg 
ataacttatg 
atcggggaca 
aagagaagaa 
aacctttggc 
ccattgcctc 
ctgatcgttc 
cgtttgccgg 
gtgaatgcgt 
gtgaaagcaa 
gctatgcagc 
gtggtctcga 
ataatcaaca 
gactcagcat 
cgttctaccc 
gagcgaaaaa 
atgaagaatc 
ggcaagagtc 
aaagagctga 
gcttcaatct 



tctacggaga 
aggagtacca 
agatgagatc 
ctctagtaga 
attagcctca 
ggagaggaag 
aatgaccaga 
gaagaaggcc 
agctatcgag 
aagtacacgg 
ggcttcttcc 
tgaaacaaat 
atttcatcac 
tcacagcgac 
gacaataatg 
tcagcaaact 
tggagacttc 
aatgtcgccg 
ggagctgaca 
gagcacatct 
gcgttgtgga 
tggagccttg 
gaatgcgtcg 
cggagagcgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1458 



<210> 38 
<211> 485 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 38 

Ser Phe Ala lie Glu Lys Phe Lys Arg Lys Gin Gin Lys Met Pro Arg 
1 5 10 15 

Gly Leu Arg Arg Ala Asp Leu Val Lys Arg His Arg His Ser Thr Thr 
20 * 25 30 

Gly Asp Lys Asp Gly Gly Val Pro Glu Val lie Gly Cys Pro Val Leu 
35 * 40 45 

Asp Pro lie lie Cys Gin Cys Pro Lys Asp Glu He Glu Leu Gly Glu 
50 55 60 

Gly Val Lys Met Thr Cys Thr Trp Glu Ser Cys Pro Tyr Ser Ser Arg 
65 70 75 80 

Pro Leu His His He Cys Tyr Gin Leu Leu Glu Asp Asn Leu Val Lys 
85 90 95 



Arg Leu Ala Ser Leu Gly Ser Ala Arg Gly Trp Thr Val Pro Gin Arg 
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Arg Asn Asn Leu 
115 

Cys Arg Cys Arg 
130 

Leu Tyr Glu Lys 
145 

Lys Lys Ala Lys 



Ala Ala He Glu 
180 

Pro Ser He Ala 
195 



Arg Ser Arg Leu 
210 



Thr He Gly Arg 
225 

Gly Gin Tyr Asp 



Val Phe His His 
260 

Glu Cys Glu Ser 
275 



Pro Leu Gin Ala 
290 

Pro Lys Val Thr 
305 



Pro Arg Phe Ser 



His Gly Asp Phe 
340 

Leu Gly Leu Ser 
355 

Asn Tyr Glu Glu 
370 

Val Ser Lys Lys 
385 

Val Ser Lys Leu 



Trp Glu 

Cys Asp 

Glu Lys 
150 

Gin Leu 
165 

Glu Lys 

Ser Ser 

His Thr 

Thr Trp 
230 

Asn Asn 
245 

Asp Tyr 
Asn His 



Lys Ser 

Asn Tyr 
310 

Leu Ser 
325 

His Val 

He Met 

Ser Pro 

Gin Arg 
390 

Pro Leu 
405 



Arg Lys 
120 

Arg Gly 
135 

Ala Val 

Pro Gin 

Lys Arg 

Thr Arg 
200 

Asp Arg 
215 

Ser Glu 

Gin Glu 

Asp Ala 

Ser Asp 
280 

Tyr Ala 
295 



Ser Pro 



Ser Ser 



Glu Thr 

Ser Pro 
360 

Phe Tyr 
375 

Glu Pro 
Ala Pro 



60 

105 

Gly Gin 

Gin Met 

Glu Lys 

Leu Gin 
170 

Gly Asp 
185 

His His 

Ser Ala 

Ser Ser 

Pro His 
250 

Asp Asp 
265 

Val He 

Ala Thr 

Asp Ser 

Ser Gly 
330 

Arg He 
345 

Val Glu 
Pro Glu 



Leu Arg 



Ser Ser 
410 



Ser Leu 



Thr Arg 
140 

Glu Lys 
155 

Phe Asn 

Ala Asp 

Thr Phe 

Ser Ser 
220 

Phe Ala 
235 

Pro Ser 

Gin lie 

Val Pro 

He Met 
300 

Gly Leu 
315 

Gly Asp 



Ser Glu 



Asn Ala 



Leu Thr 
380 

Ala Lys 
395 

Gin Leu 



110 

He Gly 
125 

Asp Lys 

Lys Lys 

Ser Lys 

Val Phe 
190 

Ser Thr 
205 

He Leu 

Gly Glu 

Asn Cys 

Asp Thr 
270 

Ala Pro 
285 

Arg Asn 
Asp Gin 



Val Asp 



His Leu 
350 



Asn Glu 
365 



Ser Thr 



Lys Ser 



Phe Asn 



Lys Phe 

Gin Ala 

Lys Ala 
160 

Pro Leu 
175 

His Ser 
Thr Thr 



Thr His 



Thr Asn 
240 

Glu Cys 
255 

Asp Phe 

Leu Pro 

Gly Thr 

Gin Thr 
320 

Asn Gin 
335 

Asn Ala 

Asn Val 

Pro He 

Thr Ser 
400 

Glu Glu 
415 
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Ser Arc Cys Gly Phe Arg Phe Asn Val Pro Val Arg Glu Met Met Asp 
420 425 430 

lie Trp Gin Glu Ser Gly Ala Leu Ser Pro Ala lie Arg Glu Thr Gin 
435 440 445 

Ala Glu Asn Thr Glu Lys Arg Ala Glu Asn Ala Ser Gly Val Leu Gin 
450 455 460 

Tyr Gly Trp Thr Pro Phe Phe Gly Asn Gly Phe Asn Leu Gly Glu Arg 
465 470 " 475 480 

Leu Tyr Tyr Phe Pro 
485 



<210> 39 
<211> 1056 
<212> DNA 

<213> Caenorhabditis elegans 



<400> 39 

atgcaaaaca 

aacagtgcga 

aaaactcgtg 

tcgtttcgtg 

gaaatcgtca 

gaatcctgtg 

gaagtgagaa 

ttgccaccac 

ccaatgcatc 

atgatggcac 

ccacctcgat 

aaaaacggcg 

ccatgcagca 

tctatattcc 

tttggatcag 

ggattgagat 

aacagccctg 

tctgtttctc 



cacagatatt 
atccttccaa 
agccggcttc 
aacaagattt 
caaatgatcg 
agaaagctcg 
gagcgacacc 
caaagctgtg 
agttgttcaa 
cataccgtag 
gcaccaagca 
agcctacgac 
ataagtgttc 
cacatcatct 
gaaataatag 
catcagagag 
attatcttct 
caccaaaaga 



tactaacttc 
caaagatcca 
tttcacagat 
ggcggcaatg 
tggatccaaa 
tgctgcgctt 
aacccgcaga 
tgtcgatctt 
ggaaaaggag 
caatggaatt 
cagcgagctc 
ggaaacaagt 
tgattcttcg 
tcgtgaccag 
tgctaacaaa 
cgagacagtt 
cgctgctctc 
atcgtcaagc 



gctcacagag 
attttcacaa 
aacaggatat 
ttcttcgcat 
gggttcgggt 
cacgaatcac 
aagcttatca 
cgagcccctc 
aacacaacat 
ttcaacacgc 
aagctttctt 
attctgatgt 
aatcacgagc 
attactgctc 
ggaaagagag 
tcagacgaag 
tacgaaggtt 
cagtaa 



cacatgatgg 
tgccaatatc 
atgtcagcaa 
atggaagagt 
ttgtcacact 
atgttcaagg 
acaatccaca 
ataatttatg 
gttttcccga 
gtagtcttgt 
cagctggtga 
gcatgcacag 
tgtctgatgt 
ttctcgacac 
caccatctgt 
agattcattg 
ccacatcgtt 



attaccgttt 
ggtcaaaccg 
tattcccttc 
cctgagtgtg 
cgattccatc 
aagaattata 
aaatgaagtt 
gagagctgag 
agctggattc 
gcagaccaaa 
atacttctgc 
acaaaactca 
ggagttgaac 
ttcaaaccat 
gacatcttct 
gtccccacat 
ccacggaaag 
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<210> 40 
<211> 351 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 40 

Met Gin Asn Thr Gin lie Phe Thr Asn Phe Ala His Arg Ala His Asp 
15 10 15 

Gly Leu Pro Phe Asn Ser Ala Asn Pro Ser Asn Lys Asp Pro lie Phe 
20 25 30 

Thr Met Pro lie Ser Val Lys Pro Lys Thr Arg Glu Pro Ala Ser Phe 
35 40 45 



Thr Asp Asn Arg lie Tyr Val Ser Asn lie Pro Phe Ser Phe Arg Glu 
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50 55 60 

Gin Asp Leu Ala Ala Met Phe Phe Ala Tyr Gly Arg Val Leu Ser Val 
65 70 75 80 

Giu lie Val Thr Asn Asp Arg Gly Ser Lys Gly Phe Gly Phe Val Thr 
85 90 95 

Leu Asp Ser lie Glu Ser Cys Glu Lys Ala Arg Ala Ala Leu His Glu 
100 105 110 

Ser His Val Gin Gly Arg lie lie Glu Val Arg Arg Ala Thr Pro Thr 
115 120 125 

Arg Arg Lys Leu lie Asn Asn Pro Gin Asn Glu Val Leu Pro Pro Pro 
130 135 140 

Lys Leu Cys Val Asp Leu Arg Ala Pro His Asn Leu Trp Arg Ala Glu 
145 * 150 155 160 

Pro Met His Gin Leu Phe Lys Glu Lys Glu Asn Thr Thr Cys Phe Pro 
165 170 175 

Glu Ala Gly Phe Met Met Ala Pro Tyr Arg Ser Asn Gly lie Phe Asn 
180 185 190 

Thr Arg Ser Leu Val Gin Thr Lys Pro Pro Arg Cys Thr Lys His Ser 
195 200 205 

Glu Leu Lys Leu Ser Ser Ala Gly Glu Tyr Phe Cys Lys Asn Gly Glu 
210 215 220 

Pro Thr Thr Glu Thr Ser lie Leu Met Cys. Met His Arg Gin Asn Ser 
225 230 235 240 

Pro Cys Ser Asn Lys Cys Ser Asp Ser Ser Asn His Glu Leu Ser Asp 
245 250 255 

Val Glu Leu Asn Ser lie Phe Pro His His Leu Arg Asp Gin lie Thr 
260 265 270 

Ala Leu Leu Asp Thr Ser Asn His Phe Gly Ser Gly Asn Asn Ser Ala 
275 280 285 

Asn Lys Gly Lys Arg Ala Pro Ser Val Thr Ser Ser Gly Leu Arg Ser 
290 ^ 295 300 

Ser Glu Ser Glu Thr Val Ser Asp Glu Glu lie His Trp Ser Pre His 
305 310 315 320 

Asn Ser Pro Asp Tyr Leu Leu Ala Ala Leu Tyr Glu Gly Ser Thr Ser 
325 330 335 

Phe His Gly Lys Ser Val Ser Pro Pro Lys Glu Ser Ser Ser Gin 
340 345 350 



<210> 41 
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<211> 1053 
<212> DNA 

<213> Caenorhabditis elegans 



<400> 41 
caaaacacac 



agatatttac taacttcgct cacagagcac atgatggatt 



actcgtgagc 
tttcgtgaac 
atcgtcacaa 
tcctgtgaga 
gtgagaagag 
ccaccaccaa 
atgcatcagt 
atggcaccat 
cctcgatgca 
aacggcgagc 
tgcagcaata 
atattcccac 
ggatcaggaa 
ttgagatcat 
agccctgatt 
gtttctccac 



cggcttcttt 
aagatttggc 
atgatcgtgg 
aagctcgtgc 
cgacaccaac 
agctgtgtgt 
tgttcaagga 
accgtagcaa 
ccaagcacag 
ctacgacgga 
agtgttctga 
atcatcttcg 
ataatagtgc 
cagagagcga 
atcttctcgc 
caaaagaatc 



cacagataac 
ggcaatgttc 
atccaaaggg 
tgcgcttcac 
ccgcagaaag 
cgatcttcga 
aaaggagaac 
tggaattttc 
cgagctcaag 
aacaagtatt 
ttcttcgaat 
tgaccagatt 
taacaaagga 
gacagtttca 
tgctctctac 
gtcaagccag 



aggatatatg 
ttcgcatatg 
ttcgggtttg 
gaatcacatg 
cttatcaaca 
gcccctcata 
acaacatgtt 
aacacgcgta 
ctttcttcag 
ctgatgtgca 
cacgagctgt 
actgctcttc 
aagagagcac 
gacgaagaga 
gaaggttcca 
taa 



tcagcaatat 
gaagagtcct 
tcacactcga 
ttcaaggaag 
atccacaaaa 
atttatggag 
ttcccgaagc 
gtcttgtgca 
ctggtgaata 
tgcacagaca 
ctgatgtgga 
tcgacacttc 
catctgtgac 
ttcattggtc 
catcgttcca 



accgtttaac 

tcccttctcg 
gagtgtggaa 
ttccatcgaa 
aattatagaa 
tgaagttttg 
agctgagcca 
tggattcatg 
gaccaaacca 
cttctgcaaa 
aaactcacca 
gttgaactct 
aaaccatttt 
atcttctgga 
cccacataac 
cggaaagtct 
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<210> 42 
<211> 350 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 42 

Gin Asn Thr Gin He Phe Thr Asn Phe Ala His Arg Ala His Asp Gly 
15 10 15 

Leu Pro Phe Asn Ser Ala Asn Pro Ser Asn Lys Asp Pro He Phe Thr 
20 25 30 

Met Pro He Ser Val Lys Pro Lys Thr Arg Glu Pro Ala Ser Phe Thr 
35 40 45 

Asp Asn Arg He Tyr Val Ser Asn He Pro Phe Ser Phe Arg Glu Gin 
50 ~ " 55 60 

Asp Leu Ala Ala Met Phe Phe Ala Tyr Gly Arg Val Leu Ser Val Glu 
65 70 75 80 

He Val Thr Asn Asp Arg Gly Ser Lys Gly Phe Gly Phe Val Thr Leu 
85 90 95 

Asp Ser He Glu Ser Cys Glu Lys Ala Arg Ala Ala Leu His Glu Ser 
100 ' 105 HO 

His Val Gin Gly Arg He He Glu Val Arg Arg Ala Thr Pro Thr Arg 
115 120 125 

Arg Lys Leu He Asn Asn Pro Gin Asn Glu Val Leu Pro Pro Pro Lys 
130 135 140 

Leu Cys Val Asp Leu Arg Ala Pro His Asn Leu Trp Arg Ala Glu Pro 
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145 150 155 160 

Met His Gin Leu Phe Lys Glu Lys Glu Asn Thr Thr Cys Phe . Pro Glu 
165 " 170 175 

Ala Gly Phe Met Met Ala Pro Tyr Arg Ser Asn Gly He Phe Asn Thr 
180 185 190 

Arg Ser Leu Val Gin Thr Lys Pro Pro Arg Cys Thr Lys His Ser Glu 
195 200 205 

Leu Lys Leu Ser Ser Ala Gly Glu Tyr Phe Cys Lys Asn Gly Glu Pro 
210 215 220 

Thr Thr Glu Thr Ser He Leu Met Cys Met His Arg Gin Asn Ser Pro 
225 230 235 240 

Cys Ser Asn Lys Cys Ser Asp Ser Ser Asn His Glu Leu Ser Asp Val 
245 250 255 

Glu Leu Asn Ser He Phe Pro His His Leu Arg Asp Gin He Thr Ala 
260 265 270 

Leu Leu Asp Thr Ser Asn His Phe Gly Ser Gly Asn Asn Ser Ala Asn 
275 280 285 

Lys Gly Lys Arg Ala Pro Ser Val Thr Ser Ser Gly Leu Arg Ser Ser 
290 ' 295 300 

Glu Ser Glu Thr Val Ser Asp Glu Glu He His Trp Ser Pro His Asn 
305 310 " 315 320 

Ser Pro Asp Tyr Leu Leu Ala Ala Leu Tyr Glu Gly Ser Thr Ser Phe 
325 330 335 

His Gly Lys Ser Val Ser Pro Pro Lys Glu Ser Ser Ser Gin 
340 345 350 



<210> 43 
<211> 1349 
<212> DNA 

<213> Caenorhabditis elegans 
<400> 43 

aagtttgagg taacaaaaat gccgccgcca gctccgttcg aggggcaaaa gaacaaaggg 60 
cacaatgtcc acaaaactgg aggaaaagca anaaagcgaa atgagaagga gccgagagtc 120 
aagggaaaca atctgaaagc tttcactttc cactccgccg tatccgctgg aaaagcgatt 180 
cgacgagctg cagatctcaa tgaaaagaag aaacatgttc tgatgatgga cagaaaaccc 240 
atcgaaacac caccaatcat tgtagcaatc gttggaccga gtaaagtcgg aaaaacgaca 300 
cttctccggg gtcttgtcaa gtattacctc cgtgatggat tcggagagat caatggtcca 360 
gtgacaattg taactggaaa gaaacgtcgt gtacagttca ttgaggtcaa aaacgatatt 420 
aatcatatga ttgatatcgc gaaagtcgca gatttggtgc ttctaatggt cgatgcatcg 480 
tatggatttg aaatggaaac ctttgaattt ctaaatattt gccaagtgca cggaatgcca 540 
cgtattatgg gagtattgaa tcatttggat cttctcgatg gaatctcacg tgtcaataag 600 
accaagaaaa ttctgaaaca tcgtttctgg acggagctct accagggcgc gaagcttttc 660 
tacatgactg gaatgatgca tggacagtat aaatataatg agatccataa cctctgcaga 720 
ttcatttctg tcatgaaatt ccgtccgatg gtgtggaaag atgctcatcc atacgttctt 780 
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tgtgazcgtt tcgaagacat taccaacgtc gaaactcttc gaacggatcc actcatcgat 840 
cgacacattg ccatgtatgg atgggttcat ggtgctcatt tgaagaatca ttcgtcgatt 900 
catgtgccag gtgttggtga tatgaggatc agtaatgtca cgagtctacc cgatccgtgt 960 
ccgctgcctg atgagattaa gaaacgagcg ttgaatgaga aagagcggaa agtgtatgct 1020 
ccgttttctg gattaggagg tgtcatttat gataaggatg cgatttatat tgagtcaaag 1080 
aatgcrcaca attttaatag aaaacgcgac ggactcgttg aagctctcga aggcgtcaag 1140 
t<z*nn~?icciq atgataaatt gaaqaaatcc tctctgcaac ttctcggtga ttcagtagca 1200 
cttgatattg atcaggaaag tgattggcca gagcctggag aagaagatga agaagatctg 1260 
gatgaggagg attttcagga tgaagaagaa gatgaagatg aggatgagga tgaggaagat 1320 
gttggtgtcg tgaaaaagga aggtgtact 1349 

<210> 44 
<211> 449 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 44 

Lys Phe Glu Val Thr Lys Met Pro Pro Pro Ala Pro Phe Glu Gly Gin 
1 5 10 15 

Lys Asn Lys Gly His Asn Val His Lys Thr Gly Gly Lys Ala Xaa Lys 
20 25 30 

Arg Asn Glu Lys Glu Pro Arg Val Lys Gly Asn Asn Leu Lys Ala Phe 
35 40 45 

Thr Phe His Ser Ala Val Ser Ala Gly Lys Ala lie Arg Arg Ala Ala 
50 55 60 

Asp Leu Asn Glu Lys Lys Lys His Val Leu Met Met Asp Arg Lys Pro 
65 70 75 80 

He Glu Thr Pro Pro He He Val Ala He Val Gly Pro Ser Lys Val 
85 90 95 

Gly Lys Thr Thr Leu Leu Arg Gly Leu Val Lys Tyr Tyr Leu Arg Asp 
100 105 110 

Gly Phe Gly Glu He Asn Gly Pro Val Thr He Val Thr Gly Lys Lys 
115 120 125 

Arg Arg Val Gin Phe lie Glu Val Lys Asn Asp He Asn His Met lie 
130 135 140 

Asp lie Ala Lys Val Ala Asp Leu Val Leu Leu Met Val Asp Ala Ser 
145 150 * 155 160 

Tyr Gly Phe Glu Met Glu Thr Phe Glu Phe Leu Asn lie Cys Gin Val 
165 170 175 

His Gly Met Pro Arg lie Met Gly Val Leu Asn His Leu Asp Leu Leu 
180 185 190 

Asp Gly lie Ser Arg Val Asn Lys Thr Lvs Lys He Leu Lys His Arg 
195 200 205 

Phe Trp Thr Glu Leu Tyr Gin Gly Ala Lys Leu Phe Tyr Met Thr Gly 
210 215 220 
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Met Met His Gly Gin Tyr Lys Tyr Asn Glu lie His Asn Leu Cys Arg 
225 " 230 " 235 240 

Phe lie Ser Val Met Lys Phe Arg Pro Met Val Trp Lys Asp Ala His 
245 250 255 

Pro Tyr Val Leu Cys Asp Arg Phe Glu Asp lie Thr Asn Val Glu Thr 
260 265 270 

Leu Arg Thr Asp Pro Leu lie Asp Arg His lie Ala Met Tyr Gly Trp 
275 280 285 

Val His Gly Ala His Leu Lys Asn His Ser Ser He His Val Pro Gly 
290 295 300 

Val Gly Asp Met Arg He Ser Asn Val Thr Ser Leu Pro Asp Pro Cys 
305 310 315 320 

Pro Leu Pro Asp Glu He Lys Lys Arg Ala Leu Asn Glu Lys Glu Arg 
325 " 330 ~ 335 

Lys Val Tyr Ala Pro Phe Ser Gly Leu Gly Gly Val He Tyr Asp Lys 
340 345 350 

Asp Ala He Tyr He Glu Ser Lys Asn Ala His Asn Phe Asn Arg Lys 
355 360 365 

Arg Asp Gly Leu Val Glu Ala Leu Glu Gly Val Lys Ser Gly Thr Asp 
370 375 380 

Asp Lys Leu Lys Lys Ser Ser Leu Gin Leu Leu Gly Asp Ser Val Ala 
385 390 395 400 

Leu Asp He Asp Gin Glu Ser Asp Trp Pro Glu Pro Gly Glu Glu Asp 
405 410 415 

Glu Glu Asp Leu Asp Glu Glu Asp Phe Gin Asp Glu Glu Glu Asp Glu 
420 425 430 

Asp Glu Asp Glu Asp Glu Glu Asp Val Gly Val Val Lys Lys Glu Gly 
435 440 445 

Val 



<210> 45 
<211> 3423 
<212> DNA 

<213> Caenorhabditis elegans 
<400> 45 

atgtcgtacc gcgtgttttc acgcgtcgat ccccatgtgc catcgacatc cacagctcca 60 
cagcgtcgtt ttcaagaaaa cttgcaaaaa ttcaagcgat tttttccgcc aaacgcacca 120 
ccaacactct gggaagtttg ctcctcgaaa caacgcagc.a aatcattgaa aaacacgttt 180 
caaacggaag tacgtgcact acgaggactt aattttacag tattgctgaa tccgtacaaa 240 
aactatctca atgatctcac aaatctatcc ggtttcacct tcgatgatct ttgtcaagca 300 
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cttcgattct 
aacgaattat 
tccatcgata 
cttaagaatc 
caagatgatg 
gtaccaattc 
gaagatgaac 
ttcaatgggc 
tccaaagatc 
acacatatca 
attgaagatg 
aattatcaaa 
ggacgtacaa 
tttcatatga 
ggatggagag 
catattatgt 
tttgataagt 
gaattgttga 
aatagatatg 
gagaataatc 
gaggacggaa 
aaagtacgaa 
tcttcatcaa 
atgattaatc 
caactcaaaa 
ttgaagtcgg 
agttgggatg 
gagaaaatgg 
gaatatgcac 
gatcatccac 
agacggaaat 
tccgatcctg 
attgttccag 
acgattgcaa 
gcgaacaaaa 
attttaaatg 
agaaatgcat 
ccgaaaccaa 
gcagccttct 
cctccagttg 
cgagcaatag 
ccgtttaata 
ggtgttttgg 
acacaatcag 
gctcctgttc 
gtgccatgga 
acggctgaaa 
atgccaactc 
gcaactccag 
ggcgattcaa 
cctggaacaa 
tttcaagagg 
tga 



ttgcatttta 
ttcgattaat 
aatcaacact 
tgaaacaaga 
agctaccttc 
ctccgcctcc 
atttacaatc 
catttgagag 
gacgattgag 
gttttagatt 
atggatattt 
ctgctcttca 
tcaagccggg 
tgggacgaca 
aatcaaatgg 
ttccattatt 
acagcttgta 
atgctggaga 
atatgagcac 
gagttgtgga 
gaaggagaaa 
cacctcgaaa 
ctccaaaaca 
aagtgccacc 
agacgccaac 
aatctgtccc 
atcatgttgg 
aagattcaga 
ctgaacaata 
ctcaaccaca 
ccggtcagaa 
caccgcctgt 
ttcatcctgt 
agacaacagc 
ctgtgtatcc 
gaaatactgc 
atacattttc 
ctcaaaatgg 
atagtgagag 
ccacatcatc 
ctcatcagaa 
ttcaaatggc 
ctcattcaaa 
actttcggcc 
caaatgcaat 
tcgcaaatag 
agatgtcttc 
ctcctcctcc 
cattggtgga 
atcagtcaaa 
cactgacacc 
atggtgatga 



tagaaaacag 
tgcaagttgc 
agtggatacg 
atcatctgaa 
tgttccaata 
ttgtagaaat 
aatgcacctg 
aactatcaaa 
cgatgtggaa 
agattctgtg 
ctccaaagaa 
tgaatgcaca 
attcgttgaa 
tattcatgga 
aaaatgggtc 
cgtaaaatgg 
tggattaaga 
gaacttattt 
tacgaaagga 
caagcatatg 
gaggaaaccg 
agttcttgag 
acgcgccctg 
agttgtaaat 
ttcggctaca 
agctccacca 
taattcgatg 
aggaagacag 
tgcgcgagat 
acaaccacca 
aactgatcaa 
tcctgctgct 
acctttgcca 
acgagtactg 
tgtcaagaag 
aactgcaagt 
attacagcaa 
aattgaaaca 
tcatcctaca 
tacttttgta 
gcaaatgctc 
attcaaacca 
ttcaacaata 
gagaagcttc 
aaagagaaga 
cagggcgttg 
gagcccgtta 
ggctccggct 
tacaatatcg 
tcctccttta 
tgaagaaatc 
agggccaaca 
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ccagttttga 
atcatttatt 
ctgtcaatga 
caaaaagatc 
caaattggca 
gatttttcga 
tggaatcatg 
atgatgtcta 
tattacggag 
atgcaattga 
tgtctggata 
actgctttcg 
gatggcctaa 
tttttccaag 
ccacggcgat 
gaactttggc 
ctaatgtcaa 
tcgaggtatc 
cggaatcagt 
cacagttcgg 
aaggatgaag 
ccacctctat 
cttgttcaaa 
acaccaccat 
aaacgtcatt 
ccaaccaaga 
gaagaagaac 
aatgtttggg 
attatcgaac 
cctccactac 
acgactccat 
ccagtggctc 
aacggaagtg 
tattccatta 
ttgacacctt 
ccatcgaaaa 
aaagctccat 
cctccgacag 
cgaagtcccc 
cctgctgcaa 
ttaaatacgg 
cgtcgttggg 
ccatatgttc 
agtcaaaata 
gaagtaggga 
gtggcagcgg 
ctatcgtcac 
cctgctcaag 
gctggttcaa 
cggacgtaca 
ggcgacgcga 
gtgaaaagct 



agtcaaatat 
caaatgataa 
acattttgga 
caatataccc 
gattgaaaga 
tgcgacaatt 
ttgggtgtaa 
agaataatgt 
acaatgaaga 
ttaactttga 
gtgcatggta 
cagttatccg 
ctacagacga 
tatggaggga 
atttggtgga 
cgtcaacatt 
tgattagaag 
catcgcattt 
atttatcagc 
cttacaagct 
ctctactcgg 
ttgctccaac 
aagagaattt 
ccccacaaca 
taccggaaat 
aaatgtcaat 
atgtagatga 
ttccacagga 
attatattcc 
caacgccgaa 
catcagacgc 
ctgttgttcc 
taaatactcc 
aacctcaaat 
ctgtagttcc 
atgcagcatc 
attatccagc 
gagcacaatc 
tcgtcccgta 
caataccatc 
aaacgtgtcg 
atcctcttcc 
agcgagtccc 
gtgttgcttc 
atttaaaatc 
cgatggcaac 
aggctccgat 
catctgcaca 
cgactacaga 
catcacacat 
ttcgaactga 
tcctaatgaa 



ggaagatgct 
ctggagggcg 
gaagcagagg 
accactattc 
cagagaaaaa 
taatccgttg 
tgatgcaaaa 
tgcgattcgc 
tttaccatcc 
tttccccaaa 
tctctacgag 
acctccatct 
atgttcagaa 
agaggatcgt 
tatctacaat 
gaaatgggct 
acatccacaa 
gttggaatcg 
aattcaaatg 
tcttatcgaa 
agttgctgca 
tcgatttatt 
ggagaaaaca 
gactgcttca 
tgagcaagaa 
tatagctgac 
gaaggattcg 
tcgaggaaaa 
tgctgctcga 
gcctccacga 
cgaagcttca 
aattgttcca 
aaaagtgaag 
accaccaatt 
gtctccaatg 
tgtagttgtc 
tggaatgcgt 
tttaatgcgt 
tggttttgtt 
acctgctagc 
acgcgttatg 
gaaatcttct 
caacaactca 
cccggctcct 
tcggcagtat 
gatggaggag 
gacaactttg 
atcaacatca 
aacaactaca 
tcgaaaaact 
aagtcaaaga 
catctacaaa 



360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3423 



<210> 46 

<211> 1140 

<212> PRT 

<213> Caenorhabditis elegans 
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<400> 46 

Met Ser Tyr Arg Val Phe Ser Arg Val Asp Pro His Val Pro Ser Thr 
15 10 15 

Ser Thr Ala Pro Gin Arg Arg Phe Gin Glu Asn Leu Gin Lys Phe Lys 
20 25 30 

Arg Phe Phe Pro Pro Asn Ala Pro Pro Thr Leu Trp Glu Val Cys Ser 
35 40 45 

Ser Lys Gin Arg Ser Lys Ser Leu Lys Asn Thr Phe Gin Thr Glu Val 
50 55 60 

Arg Ala Leu Arg Gly Leu Asn Phe Thr Val Leu Leu Asn Pro Tyr Lys 
65 70 75 80 

Asn Tyr Leu Asn Asp Leu Thr Asn Leu Ser Gly Phe Thr Phe Asp Asp 

85 90 95 

Leu Cys Gin Ala Leu Arg Phe Phe Ala Phe Tyr Arg Lys Gin Pro Val 
100 105 110 

Leu Lys Ser Asn Met Glu Asp Ala Asn Glu Leu Phe Arg Leu lie Ala 
115 120 125 

Ser Cys lie lie Tyr Ser Asn Asp Asn Trp Arg Ala Ser lie Asp Lys 
130 135 140 

Ser Thr Leu Val Asp Thr Leu Ser Met Asn He Leu Glu Lys Gin Arg 
145 150 155 160 

Leu Lys Asn Leu Lys Gin Glu Ser Ser Glu Gin Lys Asp Pro He Tyr 
165 170 175 

Pro Pro Leu Phe Gin Asp Asp Glu Leu Pro Ser Val Pro He Gin He 
180 185 190 

Gly Arg Leu Lys Asp Arg Glu Lys Val Pro He Pro Pro Pro Pro Cys 
195 200 205 

Arg Asn Asp Phe Ser Met Arg Gin Phe Asn Pro Leu Glu Asp Glu His 
210 * 215 220 

Leu Arg Ser Met His Leu Trp Asn His Val Gly Cys Asn Asp Ala Lys 
225 230 235 240 

Phe Asn Gly Pro Phe Glu Arg Thr He Lys Met Met Ser Lys Asn Asn 
245 250 255 

Val Ala He Arg Ser Lys Asp Arg Arg Leu Ser Asp Val Glu Tyr Tyr 
260 265 270 

Gly Asp Asn Glu Asp Leu Pro Ser Thr His He Ser Phe Arg Leu Asp 
275 280 285 

Ser Val Met Gin Leu lie Asn Phe Asp Phe Pro Lys He Glu Asp Asp 
290 295 300 
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Gly Tyr Phe Ser Lys Glu Cys Leu Asp Ser Ala Trp Tyr Leu Tyr Glu 
305 310 315 320 

Asn Tyr Gin Thr Ala Leu His Glu Cys Thr Thr Ala Phe Ala Val He 
325 330 335 

Arg Pro Pro Ser Gly Arg Thr He Lys Pro Gly Phe Val Glu Asp Gly 
340 345 350 

Leu Thr Thr Asp Glu Cys Ser Glu Phe His Met Met Gly Arg His He 
355 360 365 

His Gly Phe Phe Gin Val Trp Arg Glu Glu Asp Arg Gly Trp Arg Glu 
370 375 380 

Ser Asn Gly Lys Trp Val Pro Arg Arg Tyr Leu Val Asp He Tyr Asn 
385 390 395 400 

His lie Met Phe Pro Leu Phe Val Lys Trp Glu Leu Trp Pro Ser Thr 
405 410 415 

Leu Lys Trp Ala Phe Asp Lys Tyr Ser Leu Tyr Gly Leu Arg Leu Met 
420 . 425 430 

Ser Met He Arg Arg His Pro Gin Glu Leu Leu Asn Ala Gly Glu Asn 
435 440 445 

Leu Phe Ser Arg Tyr Pro Ser His Leu Leu Glu Ser Asn Arg Tyr Asp 
450 455 460 

Met Ser Thr Thr Lys Gly Arg Asn Gin Tyr Leu Ser Ala He Gin Met 
465 470 475 480 

Glu Asn Asn Arg Val Val Asp Lys His Met His Ser Ser Ala Tyr Lys 
485 490 495 

Leu Leu He Glu Glu Asp Gly Arg Arg Arg Lys Arg Lys Pro Lys Asp 
500 505 510 

Glu Ala Leu Leu Gly Val Ala Ala Lys Val Arg Thr Pro Arg Lys Val 
515 520 525 

Leu Glu Pro Pro Leu Phe Ala Pro Thr Arg Phe He Ser Ser Ser Thr 
530 535 540 

Pro Lys Gin Arg Ala Leu Leu Val Gin Lys Glu Asn Leu Glu Lys Thr 
545 550 555 560 

Met lie Asn Gin Val Pro Pro Val Val Asn Thr Pro Pro Ser Pro Gin 
565 570 575 

Gin Thr Ala Ser Gin Leu Lys Lys Thr Pro Thr Ser Ala Thr Lys Arg 
580 585 590 



His Leu Pro Glu He Glu Gin Glu Leu Lys Ser Glu Ser Val Pro Ala 
595 600 605 
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Pro Pro Pro Thr Lvs Lvs Met Ser He lie Ala Asp Ser Trp Asp Asp 
610 ' 615 620 

His Val Gly Asn Ser Met Glu Glu Glu His Val Asp Glu Lys Asp Ser 
625 630 635 6«U 

Glu Lvs Met Glu Asp Ser Glu Gly Arg Gin Asn Val Trp Val Pro Gin 
645 650 655 

Asp Arg Gly Lys Glu Tyr Ala Pro Glu Gin Tyr Ala Arg Asp He He 
660 665 670 

Glu His Tyr He Pro Ala Ala Arg Asp His Pro Pro Gin Pro Gin Gin 
675 680 685 

Pro Pro Pro Pro Leu Pro Thr Pro Lys Pro Pro Arg Arg Arg Lys Ser 
690 695 700 

Gly Gin Lys Thr Asp Gin Thr Thr Pro Ser Ser Asp Ala Glu Ala Ser 
705 710 715 720 

Ser Asp Pro Ala Pro Pro Val Pro Ala Ala Pro Val Ala Pro Val Val 
725 730 735 

Pro He Val Pro He Val Pro Val His Pro Val Pro Leu Pro Asn Gly 
740 745 750 

Ser Val Asn Thr Pro Lys Val Lys Thr He Ala Lys Thr Thr Ala Arg 
755 760 765 

Val Leu Tyr Ser He Lys Pro Gin He Pro Pro He Ala Asn Lys Thr 
770 775 780 

Val Tyr Pro Val Lys Lys Leu Thr Pro Ser Val Val Pro Ser Pro Met 
785 790 795 800 

He Leu Asn Gly Asn Thr Ala Thr Ala Ser Pro Ser Lys Asn Ala Ala 
805 810 815 

Ser Val Val Val Arg Asn Ala Tyr Thr Phe Ser Leu Gin Gin Lys Ala 
820 " 825 830 

Pro Tyr Tyr Pro Ala Gly Met Arg Pro Lys Pro Thr Gin Asn Gly He 
835 840 845 

Glu Th- Pro Pro Thr Gly Ala Gin Ser Leu Met Arg Ala Ala Phe Tyr 
850 855 860 

Ser Glu Ser His Pro Thr Arg Ser Pro Leu Val Pro Tyr Gly Phe Val 
865 870 875 880 

Pro Pro Val Ala Thr Ser Ser Thr Phe Val Pro Ala Ala Thr lie Pro 
885 890 895 

Ser Pro Ala Ser Arg Ala He Ala His Gin Lys Gin Met Leu Leu Asn 
900 905 910 

Thr Glu Thr Cys Arg Arg Val Met Pro Phe Asn He Gin Met Ala Phe 
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915 920 925 

Lys Pro Arg Arg Trp Asp Pro Leu Pro Lys Ser Ser Gly Val Leu Ala 
930 935 940 

His Ser Asn Ser Thr lie Pro Tyr Val Gin Arg Val Pro Asn Asn Ser 
945 950 955 960 

Thr Gin Ser Asp Phe Arg Pro Arg Ser Phe Ser Gin Asn Ser Val Ala 
965 970 975 

Ser Pro Ala Pro Ala Pro Val Pro Asn Ala lie Lys Arg Arg Glu Val 
980 985 990 

Gly Asn Leu Lys Ser Arg Gin Tyr Val Pro Trp lie Ala Asn Ser Arg 
995 1000 1005 

Ala Leu Val Ala Ala Ala Met Ala Thr Met Glu Glu Thr Ala Glu Lys 
1010 1015 1020 

Met Ser Ser Ser Pro Leu Leu Ser Ser Gin Ala Pro Met Thr Thr Leu 
1025 1030 1035 1040 

Met Pro Thr Pro Pro Pro Pro Ala Pro Ala Pro Ala Gin Ala Ser Ala 
1045 1050. 1055 

Gin Ser Thr Ser Ala Thr Pro Ala Leu Val Asp Thr He Ser Ala Gly 
1060 1065 1070 

Ser Thr Thr Thr Glu Thr Thr Thr Gly Asp Ser Asn Gin Ser Asn Pro 
1075 1080 1085 

Pro Leu Arg Thr Tyr Thr Ser His He Arg Lys Thr Pro Gly Thr Thr 
1090 1095 1100 

Leu Thr Pro Glu Glu He Gly Asp Ala He Arg Thr Glu Ser Gin Arg 
1105 1110 " 1115 1120 

Phe Gin Glu Asp Gly Asp Glu Gly Pro Thr Val Lys Ser Phe Leu Met 
1125 1130 1135 

Asn He Tyr Lys 
1140 



<210> 47 
<211> 1644 
<212> DNA 

<213> Caenorhabditis elegans 



<400> 47 

ttaccggaaa 

aaaatgtcaa 

catgtagatg 

gttccacagg 

cattatattc 

ccaacgccga 

tcatcagacg 



ttgagcaaga 
ttatagctga 
agaaggattc 
atcgaggaaa 
ctgctgctcg 
agcctccacg 
ccgaagcttc 



attgaagtcg 
cagttgggat 
ggagaaaatg 
agaatatgca 
agatcatcca 
aagacggaaa 
atccgatcct 



gaatctgtcc 
gatcatgttg 
gaagattcag 
cctgaacaat 
cctcaaccac 
tccggtcaga 
gcaccgcctg 



cagctccacc 
gtaattcgat 
aaggaagaca 
atgcgcgaga 
aacaaccacc 
aaactgatca 
ttcctgctgc 



accaaccaag 60 
ggaagaagaa 120 
gaatgtttgg 180 
tattatcgaa 240 
acctccacta 300 
aacgactcca 360 
tccagtggct 420 
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cctgttgttc 
gtaaatactc 
aaacctcaaa 
tctgtagttc 
aatgcagcat 
tattatccag 
ggagcacaat 
ctcgtcccgt 
acaataccat 
gaaacgtgtc 
gatcctcttc 
cagcgagtcc 
agtgttgctt 
aatttaaaat 
gcgatggcaa 
caggctccga 
gcatctgcac 
acgactacag 
acatcacaca 
attcgaactg 
ttcctaatga 



caattgttcc 
caaaagtgaa 
taccaccaat 
cgtctccaat 
ctgtagttgt 
ctggaatgcg 
ctttaatgcg 
atggttttgt 
cacctgctag 
gacgcgttat 
cgaaatcttc 
ccaacaactc 
ccccggctcc 
ctcggcagta 
cgatggagga 
tgacaacttt 
aatcaacatc 
aaacaactac 
ttcgaaaaac 
aaagtcaaag 
acatctacaa 



aattgttcca 
gacgattgca 
tgcgaacaaa 
gattttaaat 
cagaaatgca 
tccgaaacca 
tgcagccttc 
tcctccagtt 
ccgagcaata 
gccgtttaat 
tggtgttttg 
aacacaatca 
tgctcctgtt 
tgtgccatgg 
gacggctgaa 
gatgccaact 
agcaactcca 
aggcgattca 
tcctggaaca 
atttcaagag 
atga 



72 

gttcatcctg 
aagacaacag 
actgtgtatc 
ggaaatactg 
tatacatttt 
actcaaaatg 
tatagtgaga 
gccacatcat 
gctcatcaga 
attcaaatgg 
gctcattcaa 
gactttcggc 
ccaaatgcaa 
atcgcaaata 
aagatgtctt 
cctcctcctc 
gcattggtgg 
aatcagtcaa 
acactgacac 
gatggtgatg 



tacctttgcc 
cacgagtact 
ctgtcaagaa 
caactgcaag 
cattacagca 
gaattgaaac 
gtcatcctac 
ctacttttgt 
agcaaatgct 
cattcaaacc 
attcaacaat 
cgagaagctt 
taaagagaag 
gcagggcgtt 
cgagcccgtt 
cggctccggc 
atacaatatc 
atcctccttt 
ctgaagaaat 
aagggccaac 



aaacggaagt 
gtattccatt 
gttgacacct 
tccatcgaaa 
aaaagctcca 
acctccgaca 
acgaagtccc 
acctgctgca 
cttaaatacg 
acgtcgttgg 
accatatgtt 
cagtcaaaat 
agaagtaggg 
ggtggcagcg 
actatcgtca 
tcctgctcaa 
ggctggttca 
acggacgtac 
cggcgacgcg 
agtgaaaagc 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1644 



<210> 48 
<211> 547 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 48 

Leu Pro Glu He Glu Gin Glu Leu Lys Ser Glu Ser Val Pro Ala Pro 
15 10 15 



Pro Pro 



Val Gly 



Thr Lys 
20 

Asn Ser 
35 



Glu Asp 
Lys Glu 
His Tyr He Pro 



Lys Met 
50 

Arg Gly 
65 



Lys Met Ser He 



Met Glu Glu Glu 
40 

Ser Glu Gly Arg 
55 

Tyr Ala Pro Glu 
70 

Ala Ala Arg Asp 
85 



He Ala Asp Ser 
25 

His Val Asp Glu 



Gin Asn Val Trp 
60 

Gin Tyr Ala Arg 
75 

His Pro Pro Gin 
90 



Trp Asp Asp His 
30 

Lys Asp Ser Glu 
45 

Val Pro Gin Asp 



Asp He He Glu 
80 

Pro Gin Gin Pro 
95 



Pro Pro 



Gin Lys 



Asp Pro 
130 

He Val 
145 



Pro Leu 
100 

Thr Asp 
115 

Ala Pro 
Pro He 



Pro Thr Pro Lys 



Gin Thr Thr Pro 
120 

Pro Val Pro Ala 
135 

Val Pro Val His 
150 



Pro Pro Arg Arg 
105 

Ser Ser Asp Ala 



Ala Pro Val Ala 
140 

Pro Val Pro Leu 
155 



Arg Lys Ser Gly 
110 

Glu Ala Ser Ser 
125 

Pro Val Val Pro 



Pro Asn Gly Ser 
160 
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Val Asn Thr Pro Lys Val Lys Thr He Ala Lys Thr Thr Ala Arg Val 
165 170 175 

Leu Tyr Ser He Lys Pro Gin He Pro Pro He Ala Asn Lys Thr Val 
180 185 190 

Tyr Pre Val Lys Lys Leu T*r Pro Ser Val Val Pro Ser Pro Met He 
195 200 205 

Leu Asn Gly Asn Thr Ala Thr Ala Ser Pro Ser Lys Asn Ala Ala Ser 
210 " 215 220 

Val Val Val Arg Asn Ala Tyr Thr Phe Ser Leu Gin Gin Lys Ala Pro 
225 230 " 235 240 

Tyr Tyr Pro Ala Gly Met Arg Pro Lys Pro Thr Gin Asn Gly He Glu 
245 250 255 

Thr Pro Pro Thr Gly Ala Gin Ser Leu Met Arg Ala Ala Phe Tyr Ser 
260 ~ 265 270 

Glu Ser His Pro Thr Arg Ser Pro Leu Val Pro Tyr Gly Phe Val Pro 
275 280 285 

Pro Val Ala Thr Ser Ser Thr Phe Val Pro Ala Ala Thr lie Pro Ser 
290 295 300 

Pro Ala Ser Arg Ala He Ala His Gin Lys Gin Met Leu Leu Asn Thr 
305 310 315 320 

Glu Thr Cys Arg Arg Val Met Pro Phe Asn lie Gin Met Ala Phe Lys 
325 330 335 

Pro Arg Arg Trp Asp Pro Leu Pro Lys Ser Ser Gly Val Leu Ala His 
340 345 350 

Ser Asn Ser Thr He Pro Tyr Val Gin Arg Val Pro Asn Asn Ser Thr 
355 ~ 360 365 

Gin Ser Asp Phe Arg Pro Arg Ser Phe Ser Gin Asn Ser Val Ala Ser 
370 375 380 

Pro Ala Pro Ala Pro Val Pro Asn Ala He Lys Arg Arg Glu Val Gly 
385 390 395 400 

Asn Leu Lys Ser Arg Gin Tyr Val Pro Trp lie Ala Asn Ser Arg Ala 
405 410 415 

Leu Val Ala Ala Ala Met Ala Thr Met Glu Glu Thr Ala Glu Lys Met 
420 425 430 

Ser Ser Ser Pro Leu Leu Ser Ser Gin Ala Pro Met Thr Thr Leu Met 
435 440 445 

Pro Thr Pro Pro Pro Pro Ala Pro Ala Pro Ala Gin Ala Ser Ala Gin 
450 455 460 



Ser Thr Ser Ala Thr Pro Ala Leu Val Asp Thr He Ser Ala Gly Ser 
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Thr Thr Thr Glu Thr Thr Thr Gly Asp 
485 

Leu Arg Thr Tyr Thr Ser His lie Arg 
500 505 



Thr Pro Glu Glu He Gly Asp Ala He 
515 520 

Gin Glu Asp Gly Asp Glu Gly Pro Thr 
530 535 

He Tyr Lys 
545 
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475 480 

Ser Asn Gin Ser Asn Pro Pro 
490 495 

Lys Thr Pro Gly Thr Thr Leu 
510 

Arg Thr Glu Ser Gin Arg Phe 
525 

Val Lys Ser Phe Leu Met Asn 
540 



<210> 49 
<211> 1248 
<212> DNA 

<213> Homo sapiens 
<400> 49 

atgcagatgt cctacgccat ccggtgcgcc ttctaccagc tgctgctggc cgcgctcatg 60 
ctggtggcga tgctgcagct gctctacctg tcgctgctgt ccggactgca cgggcaggag 120 
gagcaagacc aatattttga gttctttccc ccgtccccac ggtccgtgga ccaggtcaag 180 
gcgcagctcc gcaccgcgct ggcctctgga ggcgtcctgg acgctagcgg cgattaccgc 240 
gtctacaggg gcctgctgaa gaccaccatg gaccccaacg atgtgatcct ggccacgcac 300 
gccagcgtgg acaacctgct gcacctgtcg ggtctgctgg agcgctggga gggcccgctg 360 
tccgtgtcgg tgttcgcggc caccaaggag gaggcgcagc tggccacggt gctggcctac 420 
gcgctgagca gccactgccc cgacatgcgc gccagggtcg ccatgcacct cgtgtgcccc 480 
tcgcgttacg aggcagccgt gcccgacccc cgggagccgg gggagtttgc cctgctgcgg 540 
tcctgccagg aggtctttga caagctagcc agggtggccc agcccgggat taattatgcg 600 
ctgggcacca atgtctccta ccccaataac ctgctgagga atctggctcg tgagggggcc 660 
aactatgccc tggtgatcga tgtggacatg gtgcccagcg aggggctgtg gagaggcctg 720 
cgggaaatgc tggatcagag caaccagtgg ggaggcaccg cgctggtggt gcctgccttc 780 
gaaatccgaa gagcccgccg catgcccatg aacaaaaacg agctggtgca gctctaccag 840 
gttggcgagg tgcggccctt ctattatggg ttgtgcaccc cctgccaggc acccaccaac 900 
tattcccgct gggtcaacct gccggaagag agcttgctgc ggcccgccta cgtggtacct 960 
tggcaggacc cctgggagcc attctacgtg gcaggaggca aggtgcccac cttcgacgag 1020 
cgctttcggc agtacggctt caaccgaatc agccaggcct gcgagctgca tgtggcgggg 1080 
tttgattttg aggtcctgaa cgaaggtttc ttggttcata agggcttcaa agaagcgttg 1140 
aagttccatc cccaaaagga ggctgaaaat cagcacaata agatcctata tcgccagttc 1200 
aaacaggagt tgaaggccaa gtaccccaac tctccccgac gctgctga 1248 

<210> 50 
<211> 415 
<212> PRT 

<213> Homo sapiens 
<400> 50 

Met Gin Met Ser Tyr Ala He Arg Cys Ala Phe Tyr Gin Leu Leu Leu 
1 5 10 15 

Ala Ala Leu Met Leu Val Ala Met Leu Gin Leu Leu Tyr Leu Ser Leu 
20 25 30 
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Leu Ser Gly Leu His Gly 
35 



75 

Gin Glu Glu Gin Asp Gin Tyr Phe Glu Phe 
40 45 



Phe Pro Pro Ser Pro Arg 
50 



Ser Val Asp Gin Val Lys Ala Gin Leu Arg 
55 60 



Thr Al^ Leu Ala Ser Glv 
65 70 



Gly Val Leu Asp Ala Ser Glv Asp Tyr Arg 
75 80 



Val Tyr Arg Gly Leu Leu 
85 



Lys Thr Thr Met Asp Pro Asn Asp Val lie 
90 95 



Leu Ala Thr His Ala Ser 
100 



Val Asp Asn Leu Leu His Leu Ser Gly Leu 
105 110 



Leu Glu Arg Trp Glu Gly 
115 



Pro Leu Ser Val Ser Val Phe Ala Ala Thr 
120 125 



Lys Glu Glu Ala Gin Leu 
130 



Ala Thr Val Leu Ala Tyr Ala Leu Ser Ser 
135 140 



His Cys Pro Asp Met Arg 
145 150 



Ala Arg Val Ala Met His Leu Val Cys Pro 
155 160 



Ser Arg Tyr Glu Ala Ala 
165 



Val Pro Asp Pro Arg Glu Pro Gly Glu Phe 
170 175 



Ala Leu Leu Arg Ser Cys 
180 



Gin Glu Val Phe Asp Lys Leu Ala Arg Val 
185 " 190 



Ala Gin Pro Gly lie Asn 
195 



Tyr Ala Leu Gly Thr Asn Val Ser Tyr Pro 
200 205 



Asn Asn Leu Leu Arg Asn 
210 



Leu Ala Arg Glu Gly Ala Asn Tyr Ala Leu 
215 220 



Val He Asp Val Asp Met 
225 230 



Val Pro Ser Glu Gly Leu Trp Arg Gly Leu 
235 240 



Arg Glu Met Leu Asp Gin 
245 



Ser Asn Gin Trp Gly Gly Thr Ala Leu Val 
250 255 



Val Pro Ala Phe Glu He 
260 



Arg Arg Ala Arg Arg Met Pro Met Asn Lys 
265 " 270 



Asn Glu Leu Val Gin Leu 
275 



Tyr Gin Val Gly Glu Val Arg Pro Phe Tyr 
280 285 



Tyr Gly Leu Cys Thr Pro 
290 



Cys Gin Ala Pro Thr Asn Tyr Ser Arg Trp 
295 300 



Val Asn Leu Pro Glu Glu 
305 310 



Ser Leu Leu Arg Pro Ala Tyr Val Val Pro 
315 320 



Trp Gin Asp Pro Trp Glu 
325 



Pro Phe Tyr Val Ala Gly Gly Lys Val Pro 
330 335 



Thr Phe Asp Glu Arg Phe Arg Gin Tyr Gly Phe Asn Arg He Ser Gin 
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340 

Ala Cys Glu Leu 
355 

Gly Phe Leu Val 
370 

Gin Lys Glu Ala 
385 



Lys Gin Glu Leu 



His Val Ala Gly 
360 

His Lys Gly Phe 
37 5 

Glu Asn Gin His 
390 

Lys Ala Lys Tyr 
405 



76 

345 

Phe Asp Phe Glu 



Lys Glu Ala Leu 
38 0 

Asn Lys lie Leu 
395 

Pro Asn Ser Pro 
410 



350 



Val Leu Asn Glu 
365 



Lys Phe His Pro 



Tyr Arg Gin Phe 
400 

Arg Arg Cys 
415 



<210> 51 

<211> 557 

<212> DNA 

<213> Homo sapiens 



<400> 51 

caggcaccca 

gcctacgtgg 

cccaccttcg 

gctgcatgtg 

cttcaaagaa 

cctatatcgc 

ctgagccctt 

aaggcctgac 

acactggact 

gngtggnctt 



ccaactattc 
taccttggca 
acgagcgctt 
gcggggtttg 
gcgttgaagt 
cagttcaaac 
ccctccccta 
tggggtaaga 
tggatatggg 
gagtaaa 



ccgctgggtc 
ggacccctgg 
tcggcagtac 
attttgaggt 
tccatcccca 
aggagttgaa 
atctgagaag 
aatgtcgctn 
gtgcttggga 



aacctgccgg 
gagccattct 
ggcttcaacc 
cctgaacgaa 
aaaggaggct 
ggccaagtac 
tcagcctctt 
cactttacag 
atcgattcct 



aagagagctt 
acgtggcagg 
gaatcaagcc 
ggtttcttgg 
gaaaatcagc 
cccaactctc 
ggctcctcag 
angtagcttg 
aactttacca 



gctgcggccc 60 
aggcaaggtg 120 
aggcctgcga 180 
ttcataaggg 240 
acaataagat 300 
cccgacgctg 360 
gccaccattt 420 
tggtgttgaa 480 
ctactaactt 540 
557 



<210> 52 

<211> 646 

<212> DNA 

<213> Homo sapiens 



<400> 52 

gggtgtgaac 

gttgaaggga 

gaacccttga 

agaactaaat 

cagacttctt 

ttaataataa 

gtacgaagct 

gtgaggtagg 

cgggatttac 

cccccatatt 

gacatttntt 



acgcatttat 
cagatctcag 
aggcgaagcc 
tttatgcctt 
agctccatcc 
caataataat 
tacagaatgt 
tattctcacc 
tcaaggccac 
caaggtccag 
accccagtcc 



ttacacattg 
gctgtcaccc 
ctgagagaac 
catctgattt 
cattcatact 
aacaacaata 
tttcacatat 
tcccttttta 
acagctagtt 
gggtttaaac 
ggcctaaaat 



tcatcggtag 
atggnaacct 
caggaatttt 
ctttccaaaa 
ttttgctctc 
atactgcgat 
agcatctcat 
cagacagggt 
agtgggtaaa 
accacagctt 
gggggcctga 



gcacataccc 
gcagaccctt 
aggcttctgt 
agtccatttc 
ctactaccca 
aatattaata 
ctgagcctcc 
aaccgaggct 
gctaggaatc 
ccttntggtn 
nggagc 



agcccagccc 
ccctcctcca 
tcaagagcta 
attaagtatt 
cccaagattg 
cttcacattt 
caacagttcc 
nagagaggta 
gatcccagca 
aagtgggagc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

646 



<210> 53 
<211> 121 
<212> PRT 

<213> Homo sapiens 
<400> 53 

Cys Gin Ala Pro Thr Asn Tyr Ser Arg Trp Vai Asn Leu Pro Glu Glu 
1 5 10 15 
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Ser Leu Leu Arg Pro Ala Tyr Val Val Pro Trp Gin Asp Pro Trp Glu 
20 . ' 25 30 

Pro Phe Tyr Val Ala Gly Gly Lys Val Pro Thr Phe Asp Glu Arg Phe 
35 " '* 40 45 

Arg Gin Tyr Gly Phe Asn Arg lie Ser Gin Ala Cys Glu Leu His Val 
50 ' 55 60 

Ala Glv Phe Asp Phe Glu Val Leu Asn Glu Gly Phe Leu Val His Lys 
65 70 75 80 

Gly Phe Lys Glu Ala Leu Lys Phe His Pro Gin Lys Glu Ala Glu Asn 
85 " 90 95 

Gin His Asn Lys lie Leu Tyr Arg Gin Phe Lys Gin Glu Leu Lys Ala 
100 " 105 110 

Lys Tyr Pro Asn Ser Pro Arg Arg Cys 
115 120 



<210> 54 
<211> 552 
<212> DNA 
<213> Homo 

<400> 54 

cctccttcct 

gagtcctacg 

tcatgggcca 

gctttccaag 

cctacatggc 

ccccgacatg 

agagttcagc 

tcaggaggag 

tgacngaaaa 

ttacacagtt 



sapiens 



tgcccctccc 
ggacaggacc 
cctcacctcc 
ccccctcccg 
tgcctgtcat 
ctgacaccaa 
tttgtgttga 
gcagggatgc 
cgttggaccg 
nt 



tacctccctg 
ctgctccagt 
cactttcgat 
tggccgtccc 
gtccctactc 
gtggtggaaa 
atgagggggg 
gcanggagca 
naaaagttgg 



cctctacatg 
ggtatccaac 
gtctcgcctc 
ctcccaagac 
tctaagccct 
ccacccctca 
agagggacaa 
ganagtgaag 
tttntcctaa 



gcttctctcc 
tcctccctgc 
ccgtggccac 
ctctcaccca 
cctgcccact 
gccccaagcc 
gtgagggcgg 
gaaggaagat 
tttttttccg 



acccctccca 60 
ccactcttcc 120 
cctgcaatta 180 
tgtagcaatc 240 
gttcctccct 300 
tgccctgtgc 360 
agagagaagt 420 
ccgaacagat 480 
ggagaacccg 540 
552 



<210> 55 
<211> 754 
<212> DNA 
<213> Homo 

<400> 55 

ggaaagtttt 

agacacattt 

gaagtgaccc 

agcgggtgcc 

catttgcctt 

atgcacgtcg 

tgggcggcac 

nancanactn 

ggaancccnn 

nannnncnng 

ttgnnctcaa 

tcnaccntac 



sapiens 



tgttttcttt 
gttaacaaag 
cagcagtctg 
acaggctgct 
tcccctccct 
nccggcggna 
caaagcaggt 
acacctgggc 
ngggaannaa 
nnncnnncnn 
ancgaancgc 
tnccanncan 



ttcccacaca 
agagaaaaaa 
tggacaatgc 
gctcgtggaa 
cccaccccnn 
cacacagaac 
gccnntnctg 
cantncggna 
nnggnancnt 
ncannnccan 
cnccacnncc 
cncttcccaa 



tttccggggt 
aactggggga 
cttgctccct 
tctagagtat 
acccctgctc 
aggccttccg 
gtgaggggag 
nnccctntnn 
annnccnnna 



tggggtgttt 
gcagggagcc 
cttccctgct 
ttgtctgtaa 
ctccagcagc 
tcaggcctga 
ttggggcact 
cnttcnntcn 
tcncnaannn 



ncaccccang nnacgnnana 
tacagganca nanncnnaac 
ccnnntcatc cttcnttncn 



tccaagactc 60 
cgtgggcaaa 120 
gaccgcgccc 180 
tatatatctg 240 
ttccccatca 300 
gccccttccc 360 
tgccccagcc 420 
aaccnattct 480 
aanaatnanc 540 
tcgnannccc 600 
tcagngaaan 660 
ncnatccnnc 720 
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aatataannn cncttataca naactccacn ngnt 

<210> 56 
<211> 555 
<212> DNA 

<213> Homo sapiens 



754 



<400> 56 

gggcagccgg 

cggcaccggg 

cgtcctctgg 

agcgccagca 

gagacagagt 

ggtaagtatc 

ccttggggac 

caagtccgtc 

agacaaggac 

taagccaaat 

<210> 57 
<211> 611 
<212> DNA 
<213> Homo 

<400> 57 

nggcntnttn 

ggcncnnngg 

ccncttgnnt 

taaaaaaagc 

tntcaggggt 

cagganaggg 

ggccnnctgc 

gcccnnattn 

nttggttaaa 

cnccttttaa 

tgggatttgg 

<210> 58 
<211> 4425 
<212> DNA 
<213> Homo 

<400> 58 

atggggaaga 

acagacgcct 

actgagacca 

agtgcttcct 

aatgacgtac 

atgttcctca 

atggttaaga 

gggcagacag 

ttgattccac 

ttccagttag 

ggctcctaca 

gtggaggaat 

atcttggaag 

cctggacatg 

gaagattcac 



gcgagtccca 
tcagtgccct 
agaagtgcgc 
gctccgcctt 
ttgtctttcc 
tcatccatgg 
ccggatgcag 
tgggatggac 
cacattanga 
caagt 



sapiens 



ttnttagttn 
ntngngnccc 
gaaaaactgt 
nggggntttt 
aanccaaaaa 
ttaaattntn 
nagtaaanag 
gggnnnaatn 
tttnaagggn 
aanccnncct 

g 



sapiens 



acaaactcct 
cagtctctgg 
ctgagaaggg 
tggagtctgt 
tccactgtgt 
ctgtccaagt 
acgaggacag 
tgaaatttcg 
tagtatacat 
agggtggcct 
aggtggtggt 
ttgttcttcc 
aagagatgaa 
tgactgtgag 
aggctttctg 



gggcaggggt 
gccccctcct 
gcgtgagctg 
cgggagcggc 
tctgtcccat 
aagtgaatgt 
cagatgtgtt 
aaccatttgg 
acttcttctt 



cctccgccgc 
gcgggtccca 
acatggaccc 
aaaaattctt 
ctgcatctcg 
ggacacactg 
cttgccttgc 
aaggaagctg 
cctcctcctn 



cttgcacctg 
actctctctt 
aaatcctcgg 
cgaggacatt 
agtcgcagag 
gagcaagtag 
gaagatcctt 
agcctgcccg 
cgactcttca 



ccctgctggg 
tcccatcgtg 
gccgccctgg 
ttacaagcca 
accccccata 
aacttattga 
caccaacccc 
tgcctacatc 
ncaanctgga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

555 



nttnttnanc 
ncnttacttg 
tttaaaaaac 
ngnccaaccn 
ctggnggnag 
aaaaagggcc 
nctggtcttt 
cccccnccgn 
nccnagggtt 
ttccccccaa 



ccggnctttn 
annngtnang 
tncggnangg 
aantttntnt 
gnttncnccn 
ccaattcccc 
tccccaanag 
gggcacaann 
tttgncccct 
nggnnccttt 



nnccaaaagg 
ggntnnncat 
ttnaggggng 
tncctaattn 
ggaaaaantt 
ccatcnttcc 
ggnttttggc 
nttncaagcc 
ttnanaaccc 
ttntgncccc 



gnnntanggg 
ccntttaaaa 
ggaanagnnc 
gcaaatcntn 
accnttaaan 
caccttnggg 
tggcccngng 
agggcccccc 
ccctttnccc 
aanannnacc 



tcatccaagt 
aaaaccgcag 
ctgtgtcctt 
caggggaaac 
cgccttcgct 
gaaaggacca 
tctggtcttt 
tgttgtctcc 
tcaggatccc 
caagcaattt 
acagaagaaa 
caagtttgaa 
tgtatcagtg 
catttgcaga 
tgagaaattc 



ctggttcttc 
tatatggttc 
ctgagctacc 
aggagcctct 
gtcccaaagt 
acccaagaat 
gtccagacag 
atggatgaaa 
aaaggaaatc 
tcttttcccc 
tcaggtggaa 
gtacaagtaa 
tgtggcctat 
aagtatagtg 
agtggacagc 



tcctcttggt 
tggtcccctc 
tgaatgagac 
tcactgacct 
cttcatccaa 
ttaagaagcg 
acaaatcaat 
actttcaccc 
gcatcgcaca 
tctcatcaga 
ggacagagca 
cagtgccaaa 
acacatatgg 
acgcttccga 
taaacagcca 



cctcctgccc 
cctgctccac 
agtgactgta 
ggaggcggag 
tgaggaggta 
gaccacagtg 
ctacaaacca 
cctgaatgag 
atggcagagt 
gcccttccag 
ccctttcacc 
gataatcacc 
gaagcctgtc 
ctgccacggt 
tggctgcttc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

611 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 
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tatcagcaag 
cacactgagg 
agtgaaatca 
cagggaattc 
aataaagtca 
gagcatggcc 
gttagggtca 
cacgaagagg 
cttgagccca 
attctgaatg 
gcaaagggag 
aagggccatt 
ctcatctatg 
gaaaattgtc 
tcacacgccc 
gaccaaagcg 
ctgctaccag 
gactgcatca 
acaaatgaaa 
tcaaagattc 
gaaggtctac 
gtgcatgttg 
tgggatttgg 
accatcaccg 
tcttccactg 
tctgtgattc 
tgcatccggg 
aaggaacaag 
accccaaagt 
gagctgtgtg 
aagcctctgt 
tgtccatcag 
gaagaatctg 
aacacacaaa 
gctcctaaca 
aagtccaagg 
tatgatggct 
ctcacagcct 
gcacacatta 
aggagctctg 
ctctccgcct 
gtccgcaatg 
ggcagccatg 
gacaagagga 
gtccattggg 
gctccctctg 
ccagccccaa 
cagcagaatg 
ctgtccaaat 
cagtcttcag 
cagcaggtct 
tgtgtctacc 
tttgctttag 
ttccaaatct 
gttgatgtga 
agatctaacc 
aaggtgtcaa 
gatctcaaac 



taaaaaccaa 
cccagatcca 
caagaaccat 
ccttctttgg 
tattcatcag 
ttgtacagtt 
attacaagga 
cacatcacac 
tgtctcatga 
gaggcaccct 
gcattgtccg 
tttccatctc 
ctgttttacc 
tggccaacaa 
acctgcgagt 
tgctgctcat 
aaaaggacct 
atcgtcataa 
aggatatgta 
gtaaacccaa 
gtgtaggttt 
aagagcctca 
tggtggtaaa 
agtggaaggc 
cctctctccg 
gtggagaggc 
tcagtgtgca 
cgcctcactg 
cattaggaaa 
ggactgaggt 
tggttgaacc 
gtggtgaggt 
cccgagcttc 
atcttctcca 
tctatgtact 
ccattggcta 
cctacagcac 
ttgttctgaa 
cccaagccct 
ggtcactgct 
atatcaccat 
ccctgttttg 
tatataccaa 
aggaagtact 
agcgccctca 
ctgaggtgga 
cctcggagga 
cccagggcgg 
atggagccgc 
ggacattttc 
cattgccaga 
tccagacctc 
gagtgcagac 
ccctaagtgt 
agatggtctc 
atgtgagccg 
atcagacact 
cagccatagt 



ggtcttccag 
agaagaagga 
aaccaaactc 
gcaggtgcgc 
aggaaatgaa 
ctctatcaac 
tcgtagtccc 
tgcttatctt 
actaccctgt 
gctggggctg 
aactgggact 
aatccctgtg 
taccggggac 
ggtggatttg 
cacagcggct 
gaagcctgat 
cactggcttc 
tgtctatatt 
cagcttccta 
aatgtgtcca 
ttatgagtca 
cacggagacc 
ctcagcaggg 
aggggccttc 
agccttccag 
cttcacactc 
gctggaagcc 
catctgtgca 
tgtgaatttc 
gccttcagtt 
tgaaggacta 
ttctgaagaa 
tgtctcagtt 
gatgccctat 
ggattatcta 
tctcaacact 
ctttggggag 
gacttttgcc 
catatggctc 
caacaatgcc 
cgcccttctg 
cctggagtca 
agcactgctg 
caagtcactt 
gaaacccaag 
gatgacatcc 
cctgacctct 
tttctcctcc 
cacatttacc 
cagcaaattc 
gctgcctggg 
cttgaaatac 
tctgcctcaa 
cagttacaca 
tggcttcatt 
gacagaagtc 
gagcttgttc 
gaaagtctat 
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ctgaagagga 
acagtggtgg 
tcatttgtga 
ctagtagatg 
gcaaactatt 
accaccaacg 
tgttacggct 
gtgttctccc 
ggccatactc 
aagaagctct 
catggactgc 
aagtcagaca 
gtgattgggg 
agcttcagcc 
cctcagtccg 
gctgagctct 
cctgggcctt 
aatggaatca 
gaggacatgg 
cagcttcaac 
gatgtaatgg 
gtacgaaagt 
gtggctgagg 
tgcctgtctg 
cccttctttg 
aaggccacgg 
tctcccgcct 
aacgggcggc 
actgtgagcg 
cctgaacacg 
gagaaggaaa 
ttatccctga 
ttgggagaca 
ggctgtggag 
aatgaaacac 
ggttaccaga 
cgatatggca 
caagctcgag 
tcccagaggc 
ataaagggag 
gagattcctc 
gcctggaaga 
gcctatgctt 
aatgaggaag 
gcaccagtgg 
tatgtgctcc 
gcaaccaaca 
acccaggaca 
aggactggga 
caagtggaca 
gaatacagca 
aatattctcc 
acttgtgatg 
gggagccgct 
cccctgaagc 
agcagcaacc 
ttcacggttc 
gattactacg 



aggagtatga 
aattgactgg 
aagtggactc 
ggaaaggcgt 
actccaatgc 
ttatgggtac 
accagtgggt 
caagcaagag 
agacagtcca 
ccttttatta 
ttgtgaagca 
ttgctcctgt 
attctgcaaa 
catcacaaag 
tctgcgccct 
cggcgtcctc 
tgaatgacca 
catatactcc 
gcttaaaggc 
agtatgaaat 
gaagaggcca 
acttccctga 
taggagtaac 
aagatgctgg 
tggagcttac 
tcctaaacta 
tccttgctgt 
aaactgtgtc 
cagaggcact 
gaaggaaaga 
caacattcaa 
aactgccacc 
tattaggctc 
agcagaatat 
agcagcttac 
gacagttgaa 
ggaaccaggg 
cctacatctt 
agaaggacaa 
gagtagaaga 
tcacagtcac 
cagcacaaga 
ttgccctggc 
ctgtgaagaa 
ggcattttta 
tcgcttatct 
tcgtgaagtg 
cagtggtggc 
aggctgcaca 
acaacaatcg 
tgaaagtgac 
cagaaaagga 
aacccaaagc 
ctgcctccaa 
caacagtgaa 
atgtcttgat 
tgcaagatgt 
agacggatga 



aatgaaactt 
aaggcagtcc 
acactttcga 
ccctatacca 
taccacggat 
ctctcttact 
gtcagaagaa 
ctttgtccac 
ggcacattat 
tctgataatg 
ggaagacatg 
cgctcggttg 
atatgatgtt 
tctcccagcc 
ccgtgctgtg 
ggtttacaac 
ggacgatgaa 
agtatcaagt 
attcaccaac 
gcatggacct 
tgcacgcctg 
gacatggatc 
agtccctgac 
acttggtatc 
aatgccttac 
ccttcccaaa 
cccagtggag 
ctgggcagta 
agagtctcaa 
cacagtcatc 
ctccctactt 
aaatgtggta 
tgccatgcaa 
ggtcctcttt 
tccagaggtc 
ctacaaacac 
caacacctgg 
catcgatgaa 
tggctgtttc 
tgaagtgacc 
tcaccctgtt 
aggggaccat 
aggtaaccag 
agacaactct 
cgaaccccag 
cacggcccag 
gatcacgaag 
tctccatgct 
ggtgactatc 
cctgttactg 
aggagaagga 
agagttcccc 
ccacaccagc 
catggcgatc 
aatgcttgaa 
ttaccttgat 
cccagtaaga 
gtttgcaatc 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 
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gctgagtaca atgctccttg cagcaaagat cttggaaatg cttga 4425 

<210> 59 
<211> 1474 
<212> PRT 

<213> Homo sapiens 
<400> 59 

Met Gly Lys Asn Lys Leu Leu His Pro Ser Leu Val Leu Leu Leu Leu 
1 5 10 15 

Val Leu Leu Pro Thr Asp Ala Ser Val Ser Gly Lys Pro Gin Tyr Met 
20 25 30 

Val Leu Val Pro Ser Leu Leu His Thr Glu Thr Thr Glu Lys Gly Cys 
35 40 45 

Val Leu Leu Ser Tyr Leu Asn Glu Thr Val Thr Val Ser Ala Ser Leu 
50 55 60 

Glu Ser Val Arg Gly Asn Arg Ser Leu Phe Thr Asp Leu Glu Ala Glu 
65 70 75 80 

Asn Asp Val Leu His Cys Val Ala Phe Ala Val Pro Lys Ser Ser Ser 
85 90 95 

Asn Glu Glu Val Met Phe Leu Thr Val Gin Val Lys Gly Pro Thr Gin 
100 105 110 

Glu Phe Lys Lys Arg Thr Thr Val Met Val Lys Asn Glu Asp Ser Leu 
115 120 125 

Val Phe Val Gin Thr Asp Lys Ser lie Tyr Lys Pro Gly Gin Thr Val 
130 135 140 

Lys Phe Arg Val Val Ser Met Asp Glu Asn Phe His Pro Leu Asn Glu 
145 150 155 160 

Leu He Pro Leu Val Tyr He Gin Asp Pro Lys Gly Asn Arg He Ala 
165 170 175 

Gin Trp Gin Ser Phe Gin Leu Glu Gly Gly Leu Lys Gin Phe Ser Phe 
180 185 190 

Pro Leu Ser Ser Glu Pro Phe Gin Gly Ser Tyr Lys Val Val Val Gin 
195 200 205 

Lys Lys Ser Gly Gly Arg Thr Glu His Pro Phe Thr Val Glu Glu Phe 
210 215 220 

Val Leu Pro Lys Phe Glu Val Gin Val Thr Val Pro Lys He He Thr 
225 230 235 240 

He Leu Glu Glu Glu Met Asn Val Ser Val Cys Gly Leu Tyr Thr Tyr 
245 250 255 

Gly Lys Pro Val Pro Gly His Val Thr Val Ser lie Cys Arg Lys Tyr 
260 265 270 
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Ser Asp Ala Ser Asp Cys His Gly Glu Asp Ser Gin Ala Phe Cys Glu 
275 280 285 

Lys Phe Ser Gly Gin Leu Asn Ser His Gly Cys Phe Tyr Gin Gin Val 
290 295 300 

Lys Thr Lys Val Phe Gin Leu Lys Arg Lys Glu Tyr Glu Met Lys Leu 
305 310 315 320 

His Thr Glu Ala Gin He Gin Glu Glu Gly Thr Val Val Glu Leu Thr 
325 330 335 

Gly Arg Gin Ser Ser Glu He Thr Arg Thr He Thr Lys Leu Ser Phe 
340 345 350 

Val Lys Val Asp Ser His Phe Arg Gin Gly He Pro Phe Phe Gly Gin 
355 360 365 

Val Arg Leu Val Asp Gly Lys Gly Val Pro He Pro Asn Lys Val He 
370 375 380 

Phe lie Arg Gly Asn Glu Ala Asn Tyr Tyr Ser Asn Ala Thr Thr Asp 
385 390 395 400 

Glu His Gly Leu Val Gin Phe Ser He Asn Thr Thr Asn Val Met Gly 
405 410 415 

Thr Ser Leu Thr Val Arg Val Asn Tyr Lys Asp Arg Ser Pro Cys Tyr 
420 425 430 

Gly Tyr Gin Trp Val Ser Glu Glu His Glu Glu Ala His His Thr Ala 
435 440 445 

Tyr Leu Val Phe Ser Pro Ser Lys Ser Phe Val His Leu Glu Pro Met 
450 455 460 

Ser His Glu Leu Pro Cys Gly His Thr Gin Thr Val Gin Ala His Tyr 
465 470 475 480 

lie Leu Asn Gly Gly Thr Leu Leu Gly Leu Lys Lys Leu Ser Phe Tyr 
485 490 495 

Tyr Leu lie Met Ala Lys Gly Gly He Val Arg Thr Gly Thr His Gly 
500 505 510 

Leu Leu Val Lys Gin Glu Asp Met Lys Gly His Phe Ser lie Ser He 
515 520 525 

Pro Val Lys Ser Asp lie Ala Pro Val Ala Arg Leu Leu lie Tyr Ala 
530 535 540 

Val Leu Pro Thr Gly Asp Val lie Gly Asp Ser Ala Lys Tyr Asp Val 
545 550 555 560 

Glu Asn Cys Leu Ala Asn Lys Val Asp Leu Ser Phe Ser Pro Ser Gin 
565 ~ 570 575 
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Ser Leu Pro Ala Ser His Ala His Leu Arg Val Thr Ala Ala Pro Gin 
580 585 590 

Ser Val Cys Ala Leu Arg Ala Val Asp Gin Ser Val Leu Leu Met Lys 
595 600 605 

Pro Asp Ala Glu Leu Ser Ala Ser Ser Val Tyr Asn Leu Leu Pro Glu 
610 615 620 

Lys Asp Leu Thr Gly Phe Pro Gly Pro Leu Asn Asp Gin Asp Asp Glu 
625 630 635 640 

Asp Cys He Asn Arg His Asn Val Tyr He Asn Gly He Thr Tyr Thr 
645 650 655 

Pro Val Ser Ser Thr Asn Glu Lys Asp Met Tyr Ser Phe Leu Glu Asp 
660 665 670 

Met Gly Leu Lys Ala Phe Thr Asn Ser Lys He Arg Lys Pro Lys Met 
675 680 685 

Cys Pro Gin Leu Gin Gin Tyr Glu Met His Gly Pro Glu Gly Leu Arg 
690 695 700 

Val Gly Phe Tyr Glu Ser Asp Val Met Gly Arg Gly His Ala Arg Leu 
705 710 715 720 

Val His Val Glu Glu Pro His Thr Glu Thr Val Arg Lys Tyr Phe Pro 
725 730 735 

Glu Thr Trp He Trp Asp Leu Val Val Val Asn Ser Ala Gly Val Ala 
740 745 750 

Glu Val Gly Val Thr Val Pro Asp Thr He Thr Glu Trp Lys Ala Gly 
755 760 765 

Ala Phe Cys Leu Ser Glu Asp Ala Gly Leu Gly He Ser Ser Thr Ala 
770 775 780 

Ser Leu Arg Ala Phe Gin Pro Phe Phe Val Glu Leu Thr Met Pro Tyr 
785 790 795 800 

Ser Val He Arg Gly Glu Ala Phe Thr Leu Lys Ala Thr Val Leu Asn 
805 810 815 

Tyr Leu Pro Lys Cys He Arg Val Ser Val Gin Leu Glu Ala Ser Pro 
820 825 830 

Ala Phe Leu Ala Val Pro Val Glu Lys Glu Gin Ala Pro His Cys He 
835 840 845 

Cys Ala Asn Gly Arg Gin Thr Val Ser Trp Ala Val Thr Pro Lys Ser 
850 855 860 

Leu Gly Asn Val Asn Phe Thr Val Ser Ala Glu Ala Leu Glu Ser Gin 
865 870 875 880 

Glu Leu Cys Gly Thr Glu Val Pro Ser Val Pro Glu His Gly Arg Lys 
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885 



83 
890 



895 



Asp Thr Val He Lys Pro Leu Leu Val Glu Pro Glu Gly Leu Glu Lys 
900 905 910 

Glu Thr Thr Phe Asn Ser Leu Leu Cys Pro Ser Gly Gly Glu Val Ser 
qi.S 920 925 

Glu Glu Leu Ser Leu Lys Leu Pro Pro Asn Val Val Glu Glu Ser Ala 
930 " 935 940 

Arg Ala Ser Val Ser Val Leu Gly Asp He Leu Gly Ser Ala Met Gin 
945 950 955 960 

Asn Thr Gin Asn Leu Leu Gin Met Pro Tyr Gly Cys Gly Glu Gin Asn 
965 970 975 

Met Val Leu Phe Ala Pro Asn He Tyr Val Leu Asp Tyr Leu Asn Glu 
980 985 990 

Thr Gin Gin Leu Thr Pro Glu Val Lys Ser Lys Ala He Gly Tyr Leu 
995 1000 1005 

Asn Thr Gly Tyr Gin Arg Gin Leu Asn Tyr Lys His Tyr Asp Gly- Ser 
1010 1015 1020 

Tyr Ser Thr Phe Gly Glu Arg Tyr Gly Arg Asn Gin Gly Asn Thr Trp 
1025 1030 1035 1040 

Leu Thr Ala Phe Val Leu Lys Thr Phe Ala Gin Ala Arg Ala Tyr He 
1045 1050 1055 

Phe He Asp Glu Ala His He Thr Gin Ala Leu He Trp Leu Ser Gin 
1060 1065 1070 

Arg Gin Lys Asp Asn Gly Cys Phe Arg Ser Ser Gly Ser Leu Leu Asn 
1075 * 1080 1085 

Asn Ala He Lys Gly Gly Val Glu Asp Glu Val Thr Leu Ser Ala Tyr 
1090 ' 1095 HOO 

He Thr He Ala Leu Leu Glu He Pro Leu Thr Val Thr His Pro Val 
1105 1110 1H5 H20 

Val Arg Asn Ala Leu Phe Cys Leu Glu Ser Ala Trp Lys Thr Ala Gin 
1125 H30 H35 

Glu Giv Asp His Gly Ser His Val Tyr Thr Lys Ala Leu Leu Ala Tyr 
1140 ~ 1145 H50 

Ala Phe Ala Leu Ala Gly Asn Gin Asp Lys Arg Lys Glu Val Leu Lys 
1155 1160 H65 

Ser Leu Asn Glu Glu Ala Val Lys Lys Asp Asn Ser Val His Trp Glu 
1170 1175 H80 

Arg Pro Gin Lys Pro Lys Ala Pro Val Gly His Phe Tyr Glu Pro Gin 



1185 



1190 



1195 



1200 
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Ala Pro Ser Ala Glu Val Glu Met Thr Ser Tyr Val Leu Leu Ala Tyr 
1205 1210 1215 

Leu Thr Ala Gin Pro Ala Pro Thr Ser Glu Asp Leu Thr Ser Ala Thr 
1220 1225 1230 

Asn He Val Lys Trp He Thr Lys Gin Gin Asn Ala Gin Gly Gly Phe 
1235 1240 1245 

Ser Ser Thr Gin Asp Thr Val Val Ala Leu His Ala Leu Ser Lys Tyr 
1250 1255 1260 

Gly Ala Ala Thr Phe Thr Arg Thr Gly Lys Ala Ala Gin Val Thr He 
1265 1270 1275 1280 

Gin Ser Ser Gly Thr Phe Ser Ser Lys Phe Gin Val Asp Asn Asn Asn 
1285 1290 1295 

Arg Leu Leu Leu Gin Gin Val Ser Leu Pro Glu Leu Pro Gly Glu Tyr 
1300 1305 1310 

Ser Met Lys Val Thr Gly Glu Gly Cys Val Tyr Leu Gin Thr Ser Leu 
1315 1320 1325 

Lys Tyr Asn He Leu Pro Glu Lys Glu Glu Phe Pro Phe Ala Leu Gly 
1330 1335 1340 

Val Gin Thr Leu Pro Gin Thr Cys Asp Glu Pro Lys Ala His Thr Ser 
1345 1350 1355 1360 

Phe Gin lie Ser Leu Ser Val Ser Tyr Thr Gly Ser Arg Ser Ala Ser 
1365 1370 1375 

Asn Met Ala He Val Asp Val Lys Met Val Ser Gly Phe He Pro Leu 
1380 1385 1390 

Lys Pro Thr Val Lys Met Leu Glu Arg Ser Asn His Val Ser Arg Thr 
1395 " 1400 1405 

Glu Val Ser Ser Asn His Val Leu lie Tyr Leu Asp Lys Val Ser Asn 
1410 1415 1420 

Gin Thr Leu Ser Leu Phe Phe Thr Val Leu Gin Asp Val Pro Val Arg 
1425 1430 1435 1440 

Asp Leu Lys Pro Ala He Val Lys Val Tyr Asp Tyr Tyr Glu Thr Asp 
1445 1450 1455 

Glu Phe Ala lie Ala Glu Tyr Asn Ala Pro Cys Ser Lys Asp Leu Gly 
1460 1465 1470 

Asn Ala 



<210> 60 
<211> 722 
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<212> DNA 

<213> Homo sapiens 



<400> 60 

ngaccnnaaa 

ggagctntga 

ccaaqatctt 

naancataga 

gngaanaaca 

ctgntgactt 

cttcagggga 

aaaccggntc 

tttgggttca 

tnttccnttt 

ttctcccggn 

aaanncgggg 

ac 



aagngtttat 
naacaggact 
tgctgcaagg 
ctttcactat 
agctcagggn 
ctgncccggn 
atgaaacccc 
cctgngtaac 
tnnnnaaggt 
ttntggnnaa 
nntttcaang 
ggttttttac 



tcatcaagtn 
ccagcaaagc 
agcattgnnc 
ggctggtttc 
ctgatttgac 
tcacatgggt 
gagaccntnt 
ctnnccctta 
ttngaggcna 
antttgnttt 
cnggntttcc 
ncccttggaa 



85 



tttaaagatn 
acttttcagc 
tcagcaattg 
ananctntta 
accttntcaa 
tagatctttc 
tnncaatnaa 
gggganattt 
agnnctgnct 
ttcaaggnat 
cagggnagtt 
ttntnnggga 



caaaaacacg 
cttggggnct 
caaactcatc 
ctgggacatt 
ggtaaatcaa 
aagcnttttt 
cgacncccnt 
ggaaanctng 
tcnnaaagca 
tnnngaagnt 
ttggnatagn 
aaaanncctn 



tgtnttntgn 60 
tcaagcattt 120 
cttntcgtag 180 
tcgcanaacc 240 
gacatggttg 300 
nactgnnngg 360 
nttgggaggc 420 
gtgtgggncn 480 
aaggggnacc 540 
annnncaacc 600 
nccnnttnna 660 
aaannnggga 720 
722 



<210> 61 
<211> 557 
<212> DNA 

<213> Homo sapiens 



<400> 61 

gtgtgcagtg 

acatcactcc 

gcctcgacac 

aaggggcagt 

tccacctttc 

cctactttgt 

taaaaaaaaa 

gcangttcac 

tttgcctttc 

gatctataaa 

<210> 62 
<211> 640 
<212> DNA 
<213> Homo 

<400> 62 

ttaaaggttg 

aggtcccaga 

ggagggggct 

cttcctgccc 

ggcagctgga 

tgtgaaatcc 

cacgtcgacg 

gctagcgtaa 

gtggggtatc 

catccctgta 

nggtttgaat 



cccccttgcc 
acacagaaac 
agcactgtgg 
gtcctgaagc 
cactcaacaa 
tttttgttat 
aaaaaaaaaa 
ttcaactgtg 
tttatgtaac 
attttta 



sapiens 



ggagatattt 
ctccggacca 
acacatcacc 
ttggaaatgg 
agaaggcaaa 
acaaatggca 
cggccgcgaa 
tctggaacat 
ttcatcatcg 
gtgattccaa 
tattaccatc 



tcttcaaacc 
attacagcct 
cctgtcccta 
ccatcttttc 
cctcccaccc 
ttatgacctt 
aactcgagag 
catcgtgcan 
tatactcctc 



tacagcttct 
ggcatcccca 
ttgcccaggc 
tgtgactgtc 
ctgtcctgct 
gtttaaagaa 
atctatgaat 
catctcaatt 
taaagtttca 



ctttgccatt 
gtctttgcct 
acgccatttc 
ttaggtgatg 
gcatggtccg 
aataaatatc 
cgaagatact 
ctttcatttn 
atcttgggca 



tgtggatttc 60 
tcttccagct 120 
caagggcagg 180 
tgtagccccc 240 
gagtctggga 300 
tcccaacctt 360 
gaaaaacccc 420 
atacatccnt 480 
ttnaaccttt 540 
557 



attttcttta 
tgcagcagga 
taagacagtc 
cgtgcctggg 
gactggggat 
aagagaagct 
ttcggatccc 
cgtatgggta 
aatagatagt 
acgcgttata 
tcaattttac 



aacaaggtca 
caggggtggg 
acagaaaaga 
caatagggac 
gccaggctgt 
gtaggtttga 
cggggcctcc 
aagccataga 
tatatacata 
cgcagtttgg 
tagccgggat 



taaataacaa 
aggttgttga 
tgggcttcag 
aggccacagt 
aatgtttctg 
agaggcaagg 
atggccatat 
gatctctttt 
tccattgtag 
tccgtccaac 



aaaacaaagt 
gtggaaaggt 
gacactgccc 
gctgtgtcga 
tgtggagtga 
gggcactgca 
gaccacccaa 
tttgggtttg 
tgggattaaa 
caggtgacag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

640 



<210> 63 
<211> 566 
<212> DNA 

<213> Homo sapiens 
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<400> 63 

ctcccccatc 

tcccctcttt 

cccctccccc 

tggccccatc 

ttcccgggct 

acggcccgcc 

cggnggggtg 

aagctacgct 

ccagncatca 

taccaanatg 



cgcatcctcc 
gcctctcccc 
gtctctgccc 
ccgtccccgc 
tggcgtccct 
tgcctccctt 
gtgtccccgg 
gggaggaggc 
tggccttcan 
ggtggccnta 



tcctcctgtc 
atccctcgct 
agccagcccc 
cctggcccct 
tctcccccct 
gcccaagtcc 
actcttctct 
cgacgcagcn 
tggcggtnca 
tnncta 



86 

ccctcttcct 
ggcctgctcc 
ctcccgactc 
ttgtgcccct 
aactccttcc 
tgagccacca 
ntncaagaac 
ttgcctnagc 
tnngnttcac 



ctctgtccct 
tgctcccctt 
cccagtttca 
tctcatcgtt 
ctcggcctcc 
tgctgacccc 
acgcttcagc 
caggcctggt 
ctnctggnag 



caccaagcca 
ctcccatcct 
tcggactccc 
ttctccctcc 
ctgccccttc 
gatggtggcc 
cggctgcccc 
ggtcctttgn 
cacntattga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

566 



<210> 64 

<211> 648 

<212> DNA 

<213> Homo sapiens 



<400> 64 

tggcgttcac 

accccagatt 

ggcgcgatgt 

gcactgggaa 

aatttgggta 

ccgtggttgc 

ctctcctctc 

ggcagaggcc 

ccttcccaca 

cttcancgca 

tcanccccnt 



gcagatcgtc 
ggaggaagcc 
gcagccgatg 
ggcttggggg 
aataaataaa 
agccggccac 
gggtgatggg 
gtctncccac 
aagcccttct 
aagggtntgc 
tngtccccna 



ttttattagc 
ttgagaggtc 
gtgagggact 
tagcagccac 
ataaataaga 
tcgggaggcc 
gagccctggg 
attctgactc 
tgccctcagt 
aggaaagggg 
aaagatgggg 



ggtctgtaaa 
agtagcacca 
gggcgccctc 
ctccttcccc 
ttcctcaagc 
cgagggccag 
ggatggcagc 
cttggtcccc 
gggtggggaa 
caaaattagg 
ccttttccnt 



gcacctccca 
gggacatggc 
gcctgccccc 
caacccaaca 
tggcctaccc 
cgggggttag 
ataggggctg 
cttgaaaccc 
ggccggtgcc 
gggnaaaaag 
ttngnggt 



gggtcccccg 60 
agggcccgag 120 
ggggttgtca 180 
gactagttca 240 
tggagaggag 300 
ttggggcgtc 360 
ggatggcctt 420 
tgttggtgtc 480 
cccttccctt 540 
gtcccttttt 600 
648 



<210> 65 
<211> 2274 
<212> PRT 
<213> Mus sp. 

<400> 65 

Met Thr Ser Ser Met Ala Ser Tyr Glu Gin Leu Val Arg Gin Val Glu 
15 10 15 



Ala Leu 



Ser Ser 



Val Leu 
50 

Val Ser 
65 



Lys Ala 
20 

His Leu 
35 

Lys His 
Ser Gly 
lie Ser 



Glu Asn Thr His 



Ser Lys Leu Glu 
40 

Leu Gin Gly Lys 
55 

Gin Thr Glu Val 
70 



Leu Arg Gin Glu Leu Arg Asp Asn 
25 30 

Thr Glu Thr Ser Gly Met Lys Glu 

45 

Leu Glu Gin Glu Ala Arg Val Leu 
60 



Leu Glu Gin Leu Lys Ala 
75 



Glu Pro 
100 



Ser Leu Tyr Asn 
85 

Ala Ala Arg Thr 



Leu Lys Phe His Ala Pro 
90 



Thr Asp 
Gly Pro 

Ser Gly Pro Ser Lys Asp Ser Phe Gly Glu Leu Ser Arg Ala 



Pro Glu Gly Ser Pro Val 
105 110 



Leu Gin 
80 

Ala Leu 
95 

His Gly 
Thr He 
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115 120 125 

Arg Leu Leu Glu Glu Leu Asp Gin Glu Arg Cys Phe Leu Leu Ser Glu 
130 135 140 

He Glu Lys Glu Glu Lys Glu Lys Leu Trp Tyr Tyr Ser Gin Leu Gin 
145 150 1 160 

Gly Leu Ser Lys Arg Leu Asp Glu Leu Pro His Val Asp Thr Phe Ser 
165 170 175 

Met Gin Met Asp Leu He Arg Gin Gin Leu Glu Phe Glu Ala Gin His 
180 185 190 

He Arg Ser Leu Met Glu Glu Arg Phe Gly Thr Ser Asp Glu Met Val 
195 200 205 

Gin Arg Ala Gin He Arg Ala Ser Arg Leu Glu Gin He Asp Lys Glu 
210 215 220 

Leu Leu Glu Ala Gin Asp Arg Val Gin Gin Thr Glu Pro Gin Ala Leu 
225 230 235 240 

Leu Ala Val Lys Pro Val Ala Val Glu Glu Glu Gin Glu Ala Glu Val 
245 250 255 

Pro Thr His Pro Glu Asp Gly Thr Pro Gin Pro Gly Asn Ser Lys Val 
260 265 270 

Glu Val Val Phe Trp Leu Leu Ser Met Leu Ala Thr Arg Asp Gin Glu 
275 280 285 

Asp Thr Ala Arg Thr Leu Leu Ala Met Ser Ser Ser Pro Glu Ser Cys 
290 295 300 

Val Ala Met Arg Arg Ser Gly Cys Leu Pro Leu Leu Leu Gin He Leu 
305 310 " 315 320 

His Gly Thr Glu Ala Gly Ser Val Gly Arg Ala Gly He Pro Gly Ala 
325 330 335 

Pro Gly Ala Lys Asp Ala Arg Met Arg Ala Asn Ala Ala Leu His Asn 
340 " 345 350 

He Val Phe Ser Gin Pro Asp Gin Gly Leu Ala Arg Lys Glu Met Arg 
355 360 365 

Val Leu His Val Leu Glu Gin He Arg Ala Tyr Cys Glu Thr Cys Trp 
370 375 380 

Asp Trp Leu Gin Ala Arg Asp Ser Gly Thr Glu Thr Pro Val Pro He 
385 390 395 400 

Glu Pro Gin He Cys Gin Ala Thr Cys Ala Val Met Lys Leu Ser Phe 
405 410 415 

Asp Glu Glu Tyr Arg Arg Ala Met Asn Glu Leu Gly Gly Leu Gin Ala 
420 425 430 
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Val Ala Glu Leu Leu Gin Val Asp Tyr Glu Met His Lys Met Thr Arg 
435 440 445 

Asp Pro Leu Asn Leu Ala Leu Arg Arg Tyr Ala Gly Met Thr Leu Thr 
450 455 4 60 

Asn Leu Thr Phe Gly Asp Val Ala Asn Lys Ala Thr Leu Cys Ala Arg 
465 4 ™ 47 5 4 80 

Arg Gly Cys Met Glu Ala He Val Ala Gin Leu Gly Ser Glu Ser Glu 

Glu Leu His Gin Val Val Ser Ser He Leu Arg Asn Leu Ser Trp Arg 
500 505 510 

Ala Asp lie Asn Ser Lys Lys Val Leu Arg Glu Val Gly Ser Met Thr 
515 520 525 

Ala Leu Met Glu Cys Val Leu Arg Ala Ser Lys Glu Ser Thr Leu Lys 
530 535 540 

Ser Val Leu Ser Ala Leu Trp Asn Leu Ser Ala His Ser Thr Glu Asn 
545 550 555 560 

Lys Ala Ala He Cys Gin Val Asp Gly Ala Leu Gly Phe Leu Val Ser 
565 570 575 

Thr Leu Thr Tyr Arg Cys Gin Gly Asn Ser Leu Ala Val lie Glu Ser 
580 585 590 

Gly Gly Gly He Leu Arg Asn Val Ser Ser Leu He Ala Thr Arg Glu 
595 600 605 

Asp Tyr Arg Gin Val Leu Arg Asp His Asn Cys Leu Gin Thr Leu Leu 
610 " 615 620 

Gin His Leu Thr Ser His Ser Leu Thr He Val Ser Asn Ala Cys Gly 
625 630 635 b4U 

Thr Leu Trp Asn Leu Ser Ala Arg Ser Pro Arg Asp Gin Glu Leu Leu 
645 650 655 

Trp Asp Leu Gly Ala Val Gly Met Leu Arg Asn Leu Val His Ser Lys 
660 665 670 

His Lys Met He Ala Met Gly Ser Ala Ala Ala Leu Arg Asn Leu Leu 
675 680 685 

Ala His Arg Pro Ala Lys Tyr Gin Ala Ala Ala Met Ala Val Ser Pro 
690 " 695 700 

Gly Thr Cys Val Pro Ser Leu Tyr Val Arg Lys Gin Arg Ala Leu Glu 
705 ' 710 715 "V 

Ala Glu Leu Asp Thr Arg His Leu Val His Ala Leu Gly His Leu Glu 
725 730 735 
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Lys Gin Ser Leu Pro Glu Ala Glu Thr Thr Ser Lys Lys Pro Leu Pro 
740 745 750 

Pro Leu Arg His Leu Asp Gly Leu Val Gin Asp Tyr Ala Ser Asp Ser 
755 760 765 

Gly Cvs Phe Asp Asp Asp Asp Ala Pro Ser Leu Ala Ala Ala Ala Thr 
770 775 780 

Thr Ala Glu Pro Ala Ser Pro Ala Val Met Ser Met Phe Leu Gly Gly 
785 790 795 800 

Pro Phe Leu Gin Gly Gin Ala Leu Ala Arg Thr Pro Pro Ala Arg Gin 
805 810 815 

Gly Gly Leu Glu Ala Glu Lys Glu Ala Gly Gly Glu Ala Ala Val Ala 
820 825 830 

Ala Lys Ala Lys Ala Lys Leu Ala Leu Ala Val Ala Arg He Asp Arg 
835 840 845 

Leu Val Glu Asp He Ser Ala Leu His Thr Ser Ser Asp Asp Ser Phe 
850 855 860 

Ser Leu Ser Ser Gly Asp Pro Gly Gin Glu Ala Pro Arg Glu Gly Arg 
865 870 875 880 

Ala Gin Ser Cys Ser Pro Cys Arg Gly Thr Glu Gly Gly Arg Arg Glu 
885 890 895 

Ala Gly Ser Arg Ala His Pro Leu Leu Arg Leu Lys Ala Ala His Thr 
900 905 910 

Ser Leu Ser Asn Asp Ser Leu Asn Ser Gly Ser Thr Ser Asp Gly Tyr 
915 920 925 

Cys Thr Arg Glu His Met Thr Pro Cys Pro Leu Ala Ala Leu Ala Glu 
930 " 935 940 

His Arg Asp Asp Pro Val Arg Gly Gin Thr Arg Pro Arg Arg Leu Asp 
945 950 955 960 

Leu Asp Leu Pro Ser Arg Ala Glu Leu Pro Ala Arg Asp Thr Ala Ala 
965 970 975 

Thr Asp Ala Arg Val Arg Thr He Lys Leu Ser Pro Thr Tyr Gin His 
980 985 990 

Val Pro Leu Leu Asp Gly Ala Ala Gly Ala Gly Val Arg Pro Leu Val 
995 1000 1005 

Gly Pro Gly Thr Ser Pro Gly Ala Arg Lys Gin Ala Trp He Pro Ala 
1010 1015 1020 

Asp Ser Leu Ser Lys Val Pro Glu Lys Leu Val Ala Ser Pro Leu Pro 
1025 1030 1035 1040 

He Ala Ser Lys Val Leu Gin Lys Leu Val Ala Gin Asp Gly Pro Met 
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1045 



90 

1050 



1055 



Ser Leu Ser Arg Cys Ser Ser Leu Ser Ser Leu Ser Ser Thr Gly His 
1060 1065 1070 

Ala Val Pro Ser Gin Ala Glu Asn Leu Asp Ser Asp Ser Ser Leu Glu 
1075 1080 1085 

Gly Leu Glu Glu Ala Gly Pro Gly Glu Ala Glu Leu Gly Arg Ala Trp 
1090 1095 1100 

Arg Ala Ser Gly Ser Thr Ser Leu Pro Val Ser He Pro Ala Pro Gin 
1105 1110 1H5 1120 

Arg Gly Arg Ser Arg Gly Leu Gly Val Glu Asp Ala Thr Pro Ser Ser 
1125 H30 1135 

Ser Ser Glu Asn Cys Val Gin Glu Thr Pro Leu Val Leu Ser Arg Cys 
1140 1145 H50 

Ser Ser Val Ser Ser Leu Gly Ser Phe Glu Ser Arg Ser He Ala Ser 
1155 1160 H65 

Ser He Pro Ser Asp Pro Cys Ser Gly Leu Gly Ser Gly Thr Val Ser 
1170 1175 1180 

Pro Ser Glu Leu Pro Asp Ser Pro Gly Gin Thr Met Pro Pro Ser Arg 
1185 1190 1195 1200 

Ser Lys Thr Pro Pro Ala Pro Pro Gly Gin Pro Glu Thr Ser Gin Phe 
1205 1210 1215 

Ser Leu Gin Trp Glu Ser Tyr Val Lys Arg Phe Leu Asp He Ala Asp 
1220 " 1225 1230 

Cys Arg Glu Arg Cys Gin Pro Pro Ser Glu Leu Asp Ala Gly Ser Val 
1235 1240 1245 

Arg Phe Thr Val Glu Lys Pro Asp Glu Asn Phe Ser Cys Ala Ser Ser 
1250 1255 1260 

Leu Ser Ala Leu Ala Leu His Glu Leu Tyr Val Gin Gin Asp Val Glu 
1265 1270 1275 1280 

Leu Arg Leu Arg Pro Pro Ala Cys Pro Glu Arg Ala Val Gly Gly Gly 
1285 1290 1295 

Gly His Arg Arg Arg Asp Glu Ala Ala Ser Arg Leu Asp Gly Pro Ala 
1300 1305 1310 

Pro Ala Gly Ser Arg Ala Arg Ser Ala Thr Asp Lys Glu Leu Glu Ala 
1315 1320 1325 

Leu Arg Glu Cys Leu Gly Ala Ala Met Pro Ala Arg Leu Arg Lys Val 
1330 1335 1340 

Ala Ser Ala Leu Val Pro Gly Arg Arg Ser Leu Pro Val Pro Val Tyr 



1345 



1350 



1355 



1360 
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Met Leu Val Pro Ala Pro Ala Arg Gly Asp Asp Ser Gly Thr Asp Ser 
1365 1370 1375 

Ala Glu Gly Thr Pro Val Asn Phe Ser Ser Ala Ala Ser Leu Ser Asp 
1380 1385 1390 

Glu Thr Leu Gin Gly Pro Ser Arg Asp Lys Pro Ala Gly Pro Gly Asp 
1395 1400 1405 

Arg Gin Lys Pro Thr Gly Arg Ala Ala Pro Ala Arg Gin Thr Arg Ser 
1410 1415 1420 

His Arg Pro Lys Ala Ala Gly Ala Gly Lys Ser Thr Glu His Thr Arg 
1425 1430 1435 1440 

Gly Pro Cys Arg Asn Arg Ala Gly Leu Glu Leu Pro Leu Ser Arg Pro 
1445 1450 1455 

Gin Ser Ala Arg Ser Asn Arg Asp Ser Ser Cys Gin Thr Arg Thr Arg 
1460 1465 1470 

Gly Asp Gly Ala Leu Gin Ser Leu Cys Leu Thr Thr Pro Thr Glu Glu 
1475 1480 1485 

Ala Val Tyr Cys Phe Tyr Asp Ser Asp Glu Glu Pro Pro Ala Thr Ala 
1490 1495 1500 

Pro Pro Pro Arg Arg Ala Ser Ala lie Pro Arg Ala Leu Lys Arg Glu 
1505 1510 1515 1520 

Lys Pro Ala Gly Arg Lys Glu Thr Pro Ser Arg Ala Ala Gin Pro Ala 
1525 ~ 1530 1535 

Thr Leu Pro Val Arg Ala Gin Pro Arg Leu lie Val Asp Glu Thr Pro 
1540 1545 1550 

Pro Cys Tyr Ser Leu Thr Ser Ser Ala Ser Ser Leu Ser Glu Pro Glu 
1555 1560 1565 

Ala Pro Glu Gin Pro Ala Asn His Ala Arg Gly Pro Glu Gin Gly Ser 
1570 1575 1580 

Lys Gin Asp Ser Ser Pro Ser Pro Arg Ala Glu Glu Glu Leu Leu Gin 
1585 1590 1595 1600 

Arg Cys lie Ser Leu Ala Met Pro Arg Arg Arg Thr Gin Val Pro Gly 
1605 1610 1615 

Ser Arg Arg Arg Lys Pro Arg Ala Leu Arg Ser Asp lie Arg Pro Thr 
1620 1625 1630 

Glu lie Thr Gin Lys Cys Gin Glu Glu Val Ala Gly Ser Asp Pro Ala 
1635 1640 1645 

Ser Asp Leu Asp Ser Val Glu Trp Gin Ala lie Gin Glu Gly Ala Asn 
1650 1655 1660 
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Ser He Val Thr Trp Leu His Gin Ala Ala Ala Lys Ala Ser Leu Glu 
1665 1670 1675 1680 

Ala Ser Ser Glu Ser Asp Ser Leu Leu Ser Leu Val Ser Gly Val Ser 
1685 1690 1695 

Ala Gly Ser Thr Leu Gin Pro Ser Lys Leu Arg Lys Gly Arg Lys Pro 
1700 1705 1710 

Ala Ala Glu Ala Gly Gly Ala Trp Arg Pro Glu Lys Arg Gly Thr Thr 
1715 1720 1725 

Ser Thr Lys He Asn Gly Ser Pro Arg Leu Pro Asn Gly Pro Glu Lys 
1730 1735 1740 

Ala Lys Gly Thr Gin Lys Met Met Ala Gly Glu Ser Thr Met Leu Arg 
1745 ' 1750 1755 1760 

Gly Arg Thr Val He Tyr Ser Ala Gly Pro Ala Ser Arg Thr Gin Ser 
1765 1770 1775 

Lys Gly He Ser Gly Pro Cys Thr Thr Pro Lys Lys Thr Gly Thr Ser 
1780 1785 1790 

Gly Thr Thr Gin Pro Glu Thr Val Thr Lys Ala Pro Ser Pro Glu Gin 
1795 1800 1805 

Gin Arg Ser Arg Ser Leu His Arg Pro Gly Lys He Ser Glu Leu Ala 
1810 1815 1820 

Ala Leu Arg His Pro Pro Arg Ser Ala Thr Pro Pro Ala Arg Leu Ala 
1825 " 1830 ' 1835 1840 

Lys Thr Pro Ser Ser Ser Ser Ser Gin Thr Ser Pro Ala Ser Gin Pro 
1845 1850 1855 

Leu Pro Arg Arg Ser Pro Leu Ala Thr Pro Thr Gly Gly Pro Leu Pro 
1860 1865 1870 

Gly Pro Gly Gly Ser Leu Val Pro Lys Ser Pro Ala Arg Ala Leu Leu 
1875 ~ 1880 1885 

Ala Lys Gin His Lys Thr Gin Lys Ser Pro Val Arg He Pro Phe Met 
1890 1895 1900 

Gin Arg Pro Ala Arg Arg Val Pro Pro Pro Leu Ala Arg Pro Ser Pro 
1905 1910 1915 1920 

Glu Pro Gly Ser Arg Gly Arg Ala Gly Ala Glu Gly Thr Pro Gly Ala 
1925 ' 1930 1935 

Arg Gly Ser Arg Leu Gly Leu Val Arg Met Ala Ser Ala Arg Ser Ser 
1940 1945 1950 

Gly Ser Glu Ser Ser Asp Arg Ser Gly Phe Arg Arg Gin Leu Thr Phe 
1955 1960 1965 

He Lys Glu Ser Pro Gly Leu Leu Arg Arg Arg Arg Ser Glu Leu Ser 
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1970 1975 1980 

Ser Ala Asp Ser Thr Ala Ser Thr Ser Gin Ala Ala Ser Pro Arg Arg 
1985 1990 1995 2000 

Gly Arg Pro Ala Leu Pro Ala Val Phe Leu Cys Ser Ser Arg Cys Asp 
2005 2010 2015 

Glu Leu Arg Val Ser Pro Arg Gin Pro Leu Ala Ala Gin Arg Ser Pro 
2020 " 2025 2030 

Gin Ala Lys Pro Gly Leu Ala Pro Leu Ala Pro Arg Arg Thr Ser Ser 
2035 ' 2040 2045 

Glu Ser Pro Ser Arg Leu Pro Val Arg Ala Ser Pro Gly Arg Pro Glu 
2050 2055 2060 

Thr Val Lys Arg Tyr Ala Ser Leu Pro His lie Ser Val Ser Arg Arg 
2065 ' 2070 2075 2080 

Ser Asp Ser Ala Val Ser Val Pro Thr* Thr Gin Ala Asn Ala Thr Arg 
2085 2090 2095 

Arg Gly Ser Asp Gly Glu Ala Arg Pro Leu Pro Arg Val Ala Pro Pro 
2100 2105 2110 

Gly Thr Thr Trp Arg Arg lie Lys Asp Glu Asp Val Pro His lie Leu 
2115 2120 2125 

Arg Ser Thr Leu Pro Ala Thr Ala Leu Pro Leu Arg Val Ser Ser Pro 
2130 2135 2140 

Glu Asp Ser Pro Ala Gly Thr Pro Gin Arg Lys Thr Ser Asp Ala Val 
2145 2150 2155 2160 

Val Gin Thr Glu Asp Val Ala Thr Ser Lys Thr Asn Ser Ser Thr Ser 
2165 2170 2175 

Pro Ser Leu Glu Ser Arg Asp Pro Pro Gin Ala Pro Ala Ser Gly Pro 
2180 2185 2190 

Val Ala Pro Gin Gly Ser Asp Val Asp Gly Pro Val Leu Thr Lys Pro 
2195 2200 2205 

Pro Ala Ser Ala Pro Phe Pro His Glu Gly Leu Ser Ala Val lie Ala 
2210 2215 2220 

Gly Phe Pro Thr Ser Arg His Gly Ser Pro Ser Arg Ala Ala Arg Val 
2225 2230 2235 2240 

Pro Pro Phe Asn Tyr Val Pro Ser Pro Met Ala Ala Ala Thr Met Ala 
2245 2250 2255 

Ser Asp Ser Ala Val Glu Lys Ala Pro Val Ser Ser Pro Ala Ser Leu 
2260 2265 2270 

Leu Glu 
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<210> 66 
<211> 942 
<212> DNA 

<213> Caenorhabditis elegans 



<400> 66 

atgtacgaag 

gaagacggtc 

ctggaaaaaa 

cactctagtc 

aagaagatga 

attcaaattc 

cgaagccatc 

cgccagggag 

ggattcgcgc 

gtattggcaa 

gagttgaagc 

caccgtaacg 

ctgatttacg 

gttcatgaag 

gaagcttatg 

ccggacactt 



atcagtactg 
tagaacctat 
tgacctcaat 
aagctctgga 
caattcactt 
cagcttcaga 
tgtcaggacc 
ccaagtcgga 
ggaaactcaa 
ttcgaagatt 
aatctatggc 
acccctataa 
ccctttgtcg 
ggattaagca 
cgatggacat 
tggacaagtg 



cgtggggtat 
caccctggca 
gtgggatgga 
gtatctcgca 
tgcaatccgt 
cagaacttgc 
cttccaacta 
tattcatttt 
aaaagtggca 
cgaatcggtt 
ttattccaac 
cgcggcctat 
tgctggttac 
tactaataca 
agccggagcc 
tggaagattc 



aattttctgg 
tcccatgcca 
cctatatcag 
gaagtgcaca 
cagtcagcat 
tggaagttca 
tatccgagca 
attatggatg 
aatgagatga 
aatggaactt 
ggatatgaat 
ttcccgtcca 
acgtttcatg 
atttattcga 
agatatgtca 
aagatgtatt 



aaggaacagg 
cttcagatat 
ttgggatatt 
gatgtgatga 
tccaacaaac 
gagcggatca 
accttatgag 
cagatatgat 
tcgacggaaa 
atttgcctag 
gggaagttca 
gaatcaaagt 
ttccgtcaca 
aagccacaat 
gagaaatgga 
ag 



agactttcga 60 
gatgttaact 120 
tattgatttt 180 
ggagttcagg 240 
ttgcccaaaa 300 
atcctacctc 360 
aaatttggct 420 
tgttagtgag 480 
aagtaaaaaa 540 
aactcacttt 600 
agtaattctt 660 
tatgcactct 720 
cgttttcgat 780 
tgctcatcag 840 
cgaaaagtac 900 
942 



<210> 67 
<211> 313 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 67 

Met Tyr Glu Asp Gin Tyr Cys Val Gly Tyr Asn 
1 5 10 



Phe Leu Glu Gly Thr 
15 



Gly Asp Phe Arg Glu Asp Gly Leu Glu Pro lie Thr Leu Ala Ser His 
20 25 30 



Ala Thr Ser Asp Met Met Leu Thr 
35 40 



Leu Glu 



Asp Gly 
50 

Ala Leu 
65 



Pro lie Ser Val 



Glu Tyr Leu Ala 
70 



Gly lie 
55 



Phe lie 
Glu Val His Arg 



Lys Met Thr 
45 

Asp Phe His 
60 

Cys Asp Glu 
75 



Ser Met Trp 



Ser Ser Gin 



Glu Phe Arg 
80 



Lys Lys 
Thr Cys 
Phe Arg 



Gin Leu 
130 



Met Thr He His Phe Ala 
85 



Pro Lys He Gin 
100 

Ala Asp Gin Ser 
115 

Tyr Pro Ser Asn 



He Pro 



Tyr Leu 
120 

Leu Met 
135 



He Arg 
90 

Ala Ser 
105 

Arg Ser 
Arg Asn 



Gin Ser Ala 



Asp Arg Thr 



His Leu Ser 
125 

Leu Ala Arg 
140 



Phe Gin Gin 
95 

Cys Trp Lys 
110 

Gly Pro Phe 
Gin Gly Ala 
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Lys Ser Asp He His Phe He Met Asp Ala Asp Met He Val Ser Glu 
145 150 155 160 

Gly Phe Ala Arg Lys Leu Lys Lys Val Ala Asn Glu Met He Asd Gly 
165 170 175 

Lys Sej. Lys Lys Val Leu Ala lie arg Arg fhe Glu ser val Asn Gly 
180 185 190 

Thr Tyr Leu Pro Arg Thr His Phe Glu Leu Lys Gin Ser Met Ala Tyr 
195 200 205 

Ser Asn Gly Tyr Glu Trp Glu Val Gin Val He Leu His Arg Asn Asp 
210 215 220 

Pro Tyr Asn Ala Ala Tyr Phe Pro Ser Arg He Lys Val Met His Ser 
225 230 235 240 

Leu He Tyr Ala Leu Cys Arg Ala Gly Tyr Thr Phe His Val Pro Ser 
245 250 255 

His Val Phe Asp Val His Glu Gly He Lys His Thr Asn Thr He Tyr 
260 265 270 

Ser Lys Ala Thr He Ala His Gin Glu Ala Tyr Ala Met Asp He Ala 
275 280 285 

Gly Ala Arg Tyr Val Arg Glu Met Asp Glu Lys Tyr Pro Asp Thr Leu 
290 295 "* 300 

Asp Lys Cys Gly Arg Phe Lys Met Tyr 
305 310 



<210> 68 
<211> 4527 
<212> DNA 

<213> Caenorhabditis elegans 



<400> 68 

atgcggctgc 

cagagtacaa 

atgcttgtcg 

ctgaaacaag 

gagacgattg 

ttgagtgaga 

ggggagactc 

aaagcttrat 

cgataccogg 

aaaattttcg 

ggagtataca 

gaggtggaaa 

cttcccaaat 

tcagtgttcg 

tctctagagc 

ggagtcaaga 

gaagcggcag 

ggaggaggtt 



taattctcaa 
atgcagctgt 
ccccagcagt 
ctacggatga 
ctgctcgagt 
tgcctgctca 
ttaacgcaga 
cagttttcat 
caattgtagt 
atccaagcag 
gcggcgagct 
cctcgaatgg 
ttgaagtgaa 
tcgatgcaaa 
ttccatggca 
aggaagagga 
ttgttttctc 
caattcgaat 



tatattattt 
ggtgtcaaca 
agtgcgaccc 
agacatgatt 
tataagcaat 
atcactaact 
actcatattt 
ccaaacagat 
gaaatcggac 
aaatctgatt 
ccaacttgcc 
agtccaggac 
tatcaagacg 
gtacacatat 
cagatggcat 
gctgatggtc 
gaatgatgag 
tgttgcctca 



gtggtttggc 
accgcggcgc 
gatcaaccgt 
gttcggattg 
ttgaagccgg 
cctaggcaaa 
gaaaacgaaa 
cgagcaatct 
ttgaaacctt 
tcacagacga 
gaagagactc 
aagtcgtcat 
tcttcgttta 
ggaaaaggcg 
gcgatggttc 
gagcgtactg 
ctcaagaggc 
gtcactgaag 



aaatacatgg 
cagttaaacc 
tctccgtttg 
aagtgagaac 
gaatcgcaca 
gctataagct 
atgagttaaa 
atcggcctgc 
acgtcggaaa 
ttggtgtaac 
tccttggaga 
ttacggtaga 
taactattaa 
ttgctggaaa 
ccactatcat 
taaagctgaa 
acaagttgct 
acatcactga 



cgtcatcgga 
agccacttat 
catgaacctt 
tgaacggaac 
gactgtcagc 
ctacatccgc 
gtacgatcaa 
ttctcttgtt 
cgctactatt 
tttggataga 
ttggtttatc 
tacatacgtg 
cgatgatttg 
ggccaaagta 
tgacgaaaat 
cagacaagga 
acatgaatgg 
aatcgaaaga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 
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aacgctactc 
ggagacacct 
gacactcccg 
ccatacaatc 
gacgctcacg 
attgaggctc 
tcgctatatg 
aatgagggag 
ctgtcgacaa 
actgttaatt 
tcacggttga 
gactttaaag 
gtcgagccag 
cttcttgtgg 
aaagttgagc 
ccacggccat 
attggaggga 
gccctcttgt 
ccaggaatgg 
ccgacagttc 
ggagaagttg 
tttgcaatca 
tttcgtccat 
gcacttttgg 
aagtatgata 
gacgaggttg 
gcagtgtact 
attgcatccc 
aaagtagata 
aaaaatcttg 
gatgtgattg 
ccatatggat 
tacttaagag 
gagcagggaa 
ggcgatagtg 
cacgccaaac 
aactctcaac 
atgcagggag 
ttggagaatg 
gtatctggaa 
aagcaagctg 
gatgtgaagt 
atgttccagg 
gcccaaaatc 
gagctaggag 
tacgcagcag 
catactcact 
tcttcattga 
ttgagttatt 
gagatcaagg 
cgtcctggaa 
gatgcggaac 
aaggacgaca 
tcgttatact 
gtggattact 
tcgctgcaag 
tttgacgagt 
atcattgctg 



atcaaatttc 
tcaagccagg 
tgaaggcaac 
atgacacttc 
gcacctcggt 
attatgatat 
tcgaggcagc 
ctgttgatgt 
ttacctatca 
ccgaacatgc 
tcgtctatgc 
tagaaggaat 
gacagaatgt 
tagatcagag 
aagacttgga 
gggaggcaat 
gtgatgctca 
accgcgagcc 
ctgaagccgc 
gaaagttctt 
agatggaaat 
atgaggaaaa 
tttttattca 
tgcttgtttt 
aagactctgg 
gccaacaaaa 
tcccaattgt 
aaggtggtga 
gaaatattcc 
agcttatctg 
gtgatatgat 
gtggagaaca 
caacaaatag 
ttcagaggga 
ataaagctgg 
aatacgcatt 
aaatggagtc 
gtgctcagga 
gaatggagaa 
atgcgtatac 
gaaaggcatt 
ttgccagtgc 
cccgtccagt 
aaacctcaga 
gattcacttc 
tcacttattc 
catttgatat 
atgatgcagt 
cttactacag 
aaataagagc 
agtctaacat 
aggtgcatac 
ctaagatgaa 
ctgatgtcac 
acgatccgga 
agaaatgtgg 
cgacagtaac 
tgctccttat 



aacgttccgt 
actcacttat 
tttgccaaag 
atctctgcaa 
tctcaccctc 
tggtggcaag 
cgtgagccca 
aggaaagagt 
ggtcatgtct 
gaccattagc 
cattattgag 
attccaaaac 
gaagtttaag 
tgttctcttg 
gaattatgac 
tgatagaaag 
atcaatcttt 
tcaacgtgaa 
attcgccgct 
cccacacacc 
tgaagctccg 
tggtctcggt 
actcaatctt 
caattatatg 
ttacgatctc 
tgttcgtatt 
gccgtcttca 
tgcagtagag 
atttgtcatt 
gccaaacgac 
gggaccagtg 
gaatatgttg 
aaatgagagc 
gttgacgtac 
ttcaacgtgg 
cgtggatcct 
tggtgcattt 
cgggggtgta 
tggaaaggcg 
aatggctgta 
cgagaatttg 
tcagaaaaaa 
agacattgaa 
atctctctca 
cactcaagac 
cgacaaacac 
caacataaga 
ttcaattaat 
ggactctttg 
tggaaatcga 
ggctctagcc 
tttgacaagt 
cgtatatttc 
ttatcaagtt 
ggagcagctc 
ggaagactgt 
aggcacgtcc 
tgcatga 
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gaggaggtca 
aatgttgtag 
agagttcagg 
gaggaaaaag 
caaccaccaa 
gacaacttca 
acgaagagtt 
cttagcttct 
cgttcgaaca 
ttcccagcaa 
agtagtcaag 
caagtggcac 
gtcacatctg 
ttaaaaactg 
tctaataatg 
aagaggtcca 
tctaatgctg 
tttatgtcag 
ccgccaatgg 
tggatttggt 
gacaccatca 
gtggctccaa 
ccatatgctg 
gaaaaggagc 
ctgaagaaag 
gtgtccgttg 
attggagaga 
atgaacttga 
gacctcaaca 
gtggttgatg 
ttaaacaatg 
aacttggttc 
caattagaga 
aaacgtgctg 
ctcaccgcat 
aatgttattt 
gccgaacgtg 
gctctaacgg 
gtgacgtatt 
gtggcatatg 
aagaaacata 
gtggagaaac 
actacctctt 
attattcgtt 
actgtcatgg 
acaagtcaag 
aacgcaattg 
gccaatggaa 
aatgacgatg 
ctgcaattgg 
gaaattgatg 
attgaggatt 
aacccgctcg 
gccgatcaaa 
aaaatgacat 
tggccaccga 
tctggttttg 



aacttgatgt 
ttgcactgaa 
tatcaacatt 
aaacaaaaat 
tcaactgcac 
cagccacccc 
tcctgcagct 
cgttgaaggc 
ttgtggtgtc 
cggctaacat 
aagttcttgt 
tatcaattga 
acaagaactc 
gcaatgatat 
ttggaggagg 
tttggagacc 
ggcttgtagt 
taatgatgat 
gcgggtcgtc 
cggatctcaa 
cctcgtgggt 
cgacttcaaa 
tccgtcgtgg 
aagatgtaac 
atggaactgt 
ctggtggagg 
tcccggtaca 
gagttgatcc 
ataactcttc 
gatcgcagaa 
ctcataagct 
caaatatttt 
caaaggctat 
acaactcttt 
ttgtcgttcg 
cccgagcagt 
gtgaagttca 
catttgtgct 
tggaaaagca 
cacttcaatt 
agatcgttga 
taaaagaatc 
acgcagttct 
ggttagtatc 
ctcttcaagc 
taacaattct 
ttctccaatc 
ctggtgtggt 
ctccgttctt 
atttgtgttg 
ccttgagtgg 
tgcaaagggt 
gtggccgtcc 
aacctgccaa 
acgccgccaa 
tttccccatc 
gtgccaagtg 



tgaaaagcag 
acaaatggat 
ctacaactat 
tgtggaagtt 
ttctgctcgc 
aatatattcc 
tttggctgat 
tacccagccg 
tcaacaaatg 
ggccccaaag 
tgatgcactt 
caaacaagca 
ttttgttggg 
cactagagaa 
attcgggggt 
ttggtgggga 
tttgaccgac 
ggatggtgct 
gcctccaccg 
ctctactagt 
cgcctccact 
acttcgcgtg 
agaaaagttt 
agtgactctg 
agtaaggcgt 
aacatcgaaa 
tataagcgct 
acaaggatat 
ggatttctcc 
ggcaagatta 
cgtacaaatg 
ggtggtcaag 
caagtttatt 
ctcagctttt 
atcgttccat 
cgcattcttg 
tcacaaggat 
aatctcgatt 
tttggatgaa 
ggcaaaaagt 
gaaaagtgga 
gagagcgtat 
ttcttacctc 
ccaacgcaac 
tctctcttcc 
caacggaaaa 
ctatcaacta 
cttcgctcag 
ctgctcccag 
caactatacc 
ttaccgattc 
ggagatggaa 
ggtatgtctc 
cttccgatta 
gcaaacacga 
actgcctcca 
gtgtgctctt 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3O60 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4527 
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<210> 69 
<211> 4560 
<212> DNA 

<213> Caenorhabditis elegans 



< / lCG> 65 

atgcggctgc 

cagagtacaa 

atgcttgtcg 

ctgaaacaag 

gagacgattg 

ttgagtgaga 

ggggagactc 

aaagctttat 

cgataccggg 

aaaattttcg 

ggagtataca 

gaggtggaaa 

cttcccaaat 

tcagtgttcg 

tctctagagc 

ggagtcaaga 

gaagcggcag 

ggaggaggtt 

aacgctactc 

ggagacacct 

gacactcccg 

ccatacaatc 

gacgctcacg 

attgaggctc 

tcgctatatg 

aatgagggag 

ctgtcgacaa 

actgttaatt 

tcacggttga 

gactttaaag 

gtcgagccag 

cttcttgtgg 

aaagttgagc 

ccacggccat 

attggaggga 

gccctcttgt 

ggaggtctta 

gctccgccaa 

acctggattt 

ccggacacca 

ggtgtggctc 

cttccatatg 

atggaaaagg 

ctcctgaaga 

attgtgtccg 

tcaattggag 

gagatgaact 

attgacctca 

gacgtggttg 

gtgttaaaca 

ttgaacttgg 



taattctcaa 
atgcagctgt 
ccccagcagt 
ctacggatga 
ctgctcgagt 
tgcctgctca 
ttaacgcaga 
cagttttcat 
caattgtagt 
atccaagcag 
gcggcgagct 
cctcgaatgg 
ttgaagtgaa 
tcgatgcaaa 
ttccatggca 
aggaagagga 
ttgttttctc 
caattcgaat 
atcaaatttc 
tcaagccagg 
tgaaggcaac 
atgacacttc 
gcacctcggt 
attatgatat 
tcgaggcagc 
ctgttgatgt 
ttacctatca 
ccgaacatgc 
tcgtctatgc 
tagaaggaat 
gacagaatgt 
tagatcagag 
aagacttgga 
gggaggcaat 
gtgatgctca 
accgcgagcc 
cagtaatgat 
tgggcgggtc 
ggtcggatct 
tcacctcgtg 
caacgacttc 
ctgtccgtcg 
agcaagatgt 
aagatggaac 
ttgctggtgg 
agatcccggt 
tgagagttga 
acaataactc 
atggatcgca 
atgctcataa 
ttccaaatat 



tatattattt 
ggtgtcaaca 
agtgcgaccc 
agacatgatt 
tataagcaat 
atcactaact 
actcatattt 
ccaaacagat 
gaaatcggac 
aaatctgatt 
ccaacttgcc 
agtccaggac 
tatcaagacg 
gtacacatat 
cagatggcat 
gctgatggtc 
gaatgatgag 
tgttgcctca 
aacgttccgt 
actcacttat 
tttgccaaag 
atctctgcaa 
tctcaccctc 
tggtggcaag 
cgtgagccca 
aggaaagagt 
ggtcatgtct 
gaccattagc 
cattattgag 
attccaaaac 
gaagtttaag 
tgttctcttg 
gaattatgac 
tgatagaaag 
atcaatcttt 
tcaacgtgaa 
gatggatggt 
gtcgcctcca 
caactctact 
ggtcgcctcc 
aaaacttcgc 
tggagaaaag 
aacagtgact 
tgtagtaagg 
aggaacatcg 
acatataagc 
tccacaagga 
ttcggatttc 
gaaggcaaga 
gctcgtacaa 
tttggtggtc 



gtggtttggc 
accgcggcgc 
gatcaaccgt 
gttcggattg 
ttgaagccgg 
cctaggcaaa 
gaaaacgaaa 
cgagcaatct 
ttgaaacctt 
tcacagacga 
gaagagactc 
aagtcgtcat 
tcttcgttta 
ggaaaaggcg 
gcgatggttc 
gagcgtactg 
ctcaagaggc 
gtcactgaag 
gaggaggtca 
aatgttgtag 
agagttcagg 
gaggaaaaag 
caaccaccaa 
gacaacttca 
acgaagagtt 
cttagcttct 
cgttcgaaca 
ttcccagcaa 
agtagtcaag 
caagtggcac 
gtcacatctg 
ttaaaaactg 
tctaataatg 
aagaggtcca 
tctaatgctg 
tttatgtcag 
gctccaggaa 
ccgccgacag 
agtggagaag 
acttttgcaa 
gtgtttcgtc 
tttgcacttt 
ctgaagtatg 
cgtgacgagg 
aaagcagtgt 
gctattgcat 
tataaagtag 
tccaaaaatc 
ttagatgtga 
atgccatatg 
aagtacttaa 



aaatacatgg 
cagttaaacc 
tctccgtttg 
aagtgagaac 
gaatcgcaca 
gctataagct 
atgagttaaa 
atcggcctgc 
acgtcggaaa 
ttggtgtaac 
tccttggaga 
ttacggtaga 
taactattaa 
ttgctggaaa 
ccactatcat 
taaagctgaa 
acaagttgct 
acatcactga 
aacttgatgt 
ttgcactgaa 
tatcaacatt 
aaacaaaaat 
tcaactgcac 
cagccacccc 
tcctgcagct 
cgttgaaggc 
ttgtggtgtc 
cggctaacat 
aagttcttgt 
tatcaattga 
acaagaactc 
gcaatgatat 
ttggaggagg 
tttggagacc 
ggcttgtagt 
aaagaagact 
tggctgaagc 
ttcgaaagtt 
ttgagatgga 
tcaatgagga 
cattttttat 
tggtgcttgt 
ataaagactc 
ttggccaaca 
acttcccaat 
cccaaggtgg 
atagaaatat 
ttgagcttat 
ttggtgatat 
gatgtggaga 
gagcaacaaa 



cgtcatcgga 
agccacttat 
catgaacctt 
tgaacggaac 
gactgtcagc 
ctacatccgc 
gtacgatcaa 
ttctcttgtt 
cgctactatt 
tttggataga 
ttggtttatc 
tacatacgtg 
cgatgatttg 
ggccaaagta 
tgacgaaaat 
cagacaagga 
acatgaatgg 
aatcgaaaga 
tgaaaagcag 
acaaatggat 
ctacaactat 
tgtggaagtt 
ttctgctcgc 
aatatattcc 
tttggctgat 
tacccagccg 
tcaacaaatg 
ggccccaaag 
tgatgcactt 
caaacaagca 
ttttgttggg 
cactagagaa 
attcgggggt 
ttggtgggga 
tttgaccgac 
taatacaccc 
cgcattcgcc 
cttcccacac 
aattgaagct 
aaatggtctc 
tcaactcaat 
tttcaattat 
tggttacgat 
aaatgttcgt 
tgtgccgtct 
tgatgcagta 
tccatttgtc 
ctggccaaac 
gatgggacca 
acagaatatg 
tagaaatgag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 
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agccaattag 
tacaaacgtg 
tggctcaccg 
cctaatgtta 
tttgccgaac 
gtagctctaa 
gcggtgacgt 
gtagtggcat 
ttgaagaaac 
aaagtggaga 
gaaactacct 
tcaattattc 
gacactgtca 
cacacaagtc 
agaaacgcaa 
aatgccaatg 
ttgaatgacg 
cgactgcaat 
gccgaaattg 
agtattgagg 
ttcaacccgc 
gttgccgatc 
ctcaaaatga 
tgttggccac 
tcctctggtt 



agacaaaggc 
ctgacaactc 
catttgtcgt 
tttcccgagc 
gtggtgaagt 
cggcatttgt 
atttggaaaa 
atgcacttca 
ataagatcgt 
aactaaaaga 
cttacgcagt 
gttggttagt 
tggctcttca 
aagtaacaat 
ttgttctcca 
gaactggtgt 
atgctccgtt 
tggatttgtg 
atgccttgag 
atttgcaaag 
tcggtggccg 
aaaaacctgc 
catacgccgc 
cgatttcccc 
ttggtgccaa 



tatcaagttt 
tttctcagct 
tcgatcgttc 
agtcgcattc 
tcatcacaag 
gctaatctcg 
gcatttggat 
attggcaaaa 
tgagaaaagt 
atcgagagcg 
tctttcttac 
atcccaacgc 
agctctctct 
tctcaacgga 
atcctatcaa 
ggtcttcgct 
cttctgctcc 
ttgcaactat 
tggttaccga 
ggtggagatg 
tccggtatgt 
caacttccga 
caagcaaaca 
atcactgcct 
gtggtgtgct 
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attgagcagg 
tttggcgata 
catcacgcca 
ttgaactctc 
gatatgcagg 
attttggaga 
gaagtatctg 
agtaagcaag 
ggagatgtga 
tatatgttcc 
ctcgcccaaa 
aacgagctag 
tcctacgcag 
aaacatactc 
ctatcttcat 
cagttgagtt 
caggagatca 
acccgtcctg 
ttcgatgcgg 
gaaaaggacg 
ctctcgttat 
ttagtggatt 
cgatcgctgc 
ccatttgacg 
cttatcattg 



gaattcagag 
gtgataaagc 
aacaatacgc 
aacaaatgga 
gaggtgctca 
atggaatgga 
gaaatgcgta 
ctggaaaggc 
agtttgccag 
aggcccgtcc 
atcaaacctc 
gaggattcac 
cagtcactta 
actcatttga 
tgaatgatgc 
attcttacta 
aggaaataag 
gaaagtctaa 
aacaggtgca 
acactaagat 
actctgatgt 
actacgatcc 
aagagaaatg 
agtcgacagt 
ctgtgctcct 



<210> 70 
<211> 1508 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 70 

Met Arg Leu Leu lie Leu Asn lie Leu Phe Val 
15 10 



ggagttgacg 
tggttcaacg 
attcgtggat 
gtctggtgca 
ggacgggggt 
gaatggaaag 
tacaatggct 
attcgagaat 
tgctcagaaa 
agtagacatt 
agaatctctc 
ttccactcaa 
ttccgacaaa 
tatcaacata 
agtttcaatt 
cagggactct 
agctggaaat 
catggctcta 
tactttgaca 
gaacgtatat 
cacttatcaa 
ggaggagcag 
tggggaagac 
aacaggcacg 
tattgcatga 



3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 



Val Trp Gin lie His 
15 



Gly Val He Gly Gin Ser Thr Asn Ala Ala Val Val Ser Thr Thr Ala 
20 25 30 

Ala Pro Val Lys Pro Ala Thr Tyr Met Leu Val Ala Pro Ala Val Val 
35 40 45 

Arg Pro Asp Gin Pro Phe Ser Val Cys Met Asn Leu Leu Lys Gin Ala 
50 55 60 

Thr Asp Glu Asp Met He Val Arg He Glu Val Arg Thr Glu Arg Asn 
65 70 75 80 

Glu Thr He Ala Ala Arg Val lie Ser Asn Leu Lys Pro Gly He Ala 
85 90 95 

Gin Thr Val Ser Leu Ser Glu Met Pro Ala Gin Ser Leu Thr Pro Arg 
100 105 110 

Gin Ser Tyr Lys Leu Tyr He Arg Gly Glu Thr Leu Asn Ala Glu Leu 
115 120 " 125 



He Phe Glu Asn Glu Asn Glu Leu Lys Tyr Asp Gin Lys Ala Leu Ser 
130 135 140 
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Val Phe He Gin Thr Asp Arg Ala He Tyr Arg Pro Ala Ser Leu Val 
14 5 150 155 160 

Arg Tyr Arg Ala He Val Val Lys Ser Asp Leu Lys Pro Tyr Val Gly 
165 170 175 

Asn Ala Thr He Lys He Phe Asp Pro Ser Arg Asn Leu He Ser Gin 
180 185 190 

Thr lie Gly Val Thr Leu Asp Arg Gly Val Tyr Ser Gly Glu Leu Gin 
195 200 205 

Leu Ala Glu Glu Thr Leu Leu Gly Asp Trp Phe He Glu Val Glu Thr 
210 215 220 

Ser Asn Gly Val Gin Asp Lys Ser Ser Phe Thr Val Asp Thr Tyr Val 
225 230 235 * 240 

Leu Pro Lys Phe Glu Val Asn He Lys Thr Ser Ser Phe He Thr He 
245 250 255 

Asn Asp Asp Leu Ser Val Phe Val Asp Ala Lys Tyr Thr Tyr Gly Lys 
260 265 270 

Gly Val Ala Gly Lys Ala Lys Val Ser Leu Glu Leu Pro Trp His Arg 
275 280 285 

Trp His Ala Met Val Pro Thr He He Asp Glu Asn Gly Val Lys Lys 
290 295 300 

Glu Glu Glu Leu Met Val Glu Arg Thr Val Lys Leu Asn Arg Gin Gly 
305 310 315 320 

Glu Ala Ala Val Val Phe Ser Asn Asp Glu Leu Lys Arg His Lys Leu 
325 330 335 

Leu His Glu Trp Gly Gly Gly Ser He Arg He Val Ala Ser Val Thr 
340 345 350 

Glu Asp He Thr Glu He Glu Arg Asn Ala Thr His Gin He Ser Thr 
355 360 365 

Phe Arg Glu Glu Val Lys Leu Asp Val Glu Lys Gin Gly Asp Thr Phe 
370 375 380 

Lys Pro Gly Leu Thr Tyr Asn Val Val Val Ala Leu Lys Gin Met Asp 
385 390 395 400 

Asp Thr Pro Val Lys Ala Thr Leu Pro Lys Arg Val Gin Val Ser Thr 
405 410 415 

Phe Tyr Asn Tyr Pro Tyr Asn His Asp Thr Ser Ser Leu Gin Glu Glu 
420 425 430 

Lys Glu Thr Lys He Val Glu Val Asp Ala His Gly Thr Ser Val Leu 
435 440 445 
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Thr Leu Gin Pro Pro lie Asn Cys Thr Ser Ala Arg lie- Glu Ala His 
450 455 " 460 

Tvr Asp lie Gly Gly Lys Asp Asn Phe Thr Ala Thr Pro lie Tyr Ser 
465 470 475 480 

Ser Leu Tyr Val Glu Ala Ala Val Ser Pro Thr Lys Ser Phe Leu Gin 
485 490 495 

Leu Leu Ala Asp Asn Glu Gly Ala Val Asp Val Gly Lys Ser Leu Ser 
500 505 " 510 

Phe Ser Leu Lys Ala Thr Gin Pro Leu Ser Thr lie Thr Tyr Gin Val 
515 520 525 

Met Ser Arg Ser Asn lie Val Val Ser Gin Gin Met Thr Val Asn Ser 
530 535 540 

Glu His Ala Thr lie Ser Phe Pro Ala Thr Ala Asn Met Ala Pro Lys 
545 550 555 560 

Ser Arg Leu lie Val Tyr Ala lie lie Glu Ser Ser Gin Glu Val Leu 
565 570 575 

Val Asp Ala Leu Asp Phe Lys Val Glu Gly lie Phe Gin Asn Gin Val 
580 585 590 

Ala Leu Ser lie Asp Lys Gin Ala Val Glu Pro Gly Gin Asn Val Lys 
595 600 605 

Phe Lys Val Thr Ser Asp Lys Asn Ser Phe Val Gly Leu Leu Val Val 
610 615 620 

Asp Gin Ser Val Leu Leu Leu Lys Thr Gly Asn Asp lie Thr Arg Glu 
625 630 635 640 

Lys Val Glu Gin Asp Leu Glu Asn Tyr Asp Ser Asn Asn Val Gly Gly 
645 650 655 

Gly Phe Gly Gly Pro Arg Pro Trp Glu Ala lie Asp Arg Lys Lys Arg 
660 665 670 

Ser lie Trp Arg Pro Trp Trp Gly lie Gly Gly Ser Asp Ala Gin Ser 
675 680 685 

lie Phe Ser Asn Ala Gly Leu Val Val Leu Thr Asp Ala Leu Leu Tyr 
690 695 700 

Arg Glu Pro Gin Arg Glu Phe Met Ser Val Met Met Met Asp Gly Ala 
705 710 715 720 

Pro Gly Met Ala Glu Ala Ala Phe Ala Ala Pro Pro Met Gly Gly Ser 
725 730 735 

Ser Pro Pro Pro Pro Thr Val Arg Lys Phe Phe Pro His Thr Trp lie 
740 745 750 

Trp Ser Asp Leu Asn Ser Thr Ser Gly Glu Val Glu Met Glu lie Glu 
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755 760 765 

Ala Pro Asp Thr He Thr Ser Tro Val Ala Ser Thr Phe Ala He Asn 
770 775 780 

Glu Glu Asn Gly Leu Gly Val Ala Pro Thr Thr Ser Lys Leu Arg Val 

TOC —fn>r\ -rr\— — ^. — 

• v -* / Z7u t y o DUU 

Phe Arg Pro Phe Phe He Gin Leu Asn Leu Pro Tyr Ala Val Arg Arg 
805 810 * 815 

Gly Glu Lys Phe Ala Leu Leu Val Leu Val Phe Asn Tyr Met Glu Lys 
820 825 830 

Glu Gin Asp Val Thr Val Thr Leu Lys Tyr Asp Lys Asp Ser Gly Tyr 
835 840 845 

Asp Leu Leu Lys Lys Asp Gly Thr Val Val Arg Arg Asp Glu Val Gly 
850 855 860 

Gin Gin Asn Val Arg He Val Ser Val Ala Gly Gly Gly Thr Ser Lys 
865 870 875 880 

Ala Val Tyr Phe Pro lie Val Pro Ser Ser He Gly Glu He Pro Val 
885 890 895 

His He Ser Ala He Ala Ser Gin Gly Gly Asp Ala Val Glu Met Asn 
900 905 910 

Leu Arg Val Asp Pro Gin Gly Tyr Lys Val Asp Arg Asn He Pro Phe 
915 920 925 

Val lie Asp Leu Asn Asn Asn Ser Ser Asp Phe Ser Lys Asn Leu Glu 
930 935 940 

Leu lie Trp Pro Asn Asp Val Val Asp Gly Ser Gin Lys Ala Arg Leu 
945 950 955 960 

Asp Val lie Gly Asp Met Met Gly Pro Val Leu Asn Asn Ala His Lys 
965 970 975 

Leu Val Gin Met Pro Tyr Gly Cys Gly Glu Gin Asn Met Leu Asn Leu 
980 985 990 

Val Pro Asn lie Leu Val Val Lys Tyr Leu Arg Ala Thr Asn Arg Asn 
995 1000 1005 

Glu Ser Gin Leu Glu Thr Lys Ala lie Lys Phe lie Glu Gin Gly lie 
1010 1015 1020 

Gin Arg Glu Leu Thr Tyr Lys Arg Ala Asp Asn Ser Phe Ser Ala Phe 
1025 1030 1035 1040 

Gly Asp Ser Asp Lys Ala Gly Ser Thr Trp Leu Thr Ala Phe Val Val 
1045 1050 1055 

Arg Ser Phe His His Ala Lys Gin Tyr Ala Phe Val Asp Pro Asn Val 
1060 1065 1070 
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lie Ser Arg Ala Val Ala Phe Leu Asn Ser Gin Gin Met Glu Ser Gly 
1075 1080 1085 

Ala Phe Ala Glu Arg Gly Glu Val His His Lys Asp Met Gin Gly Gly 
1090 1095 1100 

Ala Gin Asp Gly Gly Val Ala Leu Thr Ala Phe Val Leu He Ser He 
1105 1110 1115 1120 

Leu Glu Asn Gly Met Glu Asn Gly Lys Ala Val Thr Tyr Leu Glu Lys 
1125 1130 1135 

His Leu Asp Glu Val Ser Gly Asn Ala Tyr Thr Met Ala Val Val Ala 
1140 1145 1150 

Tyr Ala Leu Gin Leu Ala Lys Ser Lys Gin Ala Gly Lys Ala Phe Glu 
1155 1160 1165 

Asn Leu Lys Lys His Lys He Val Glu Lys Ser Gly Asp Val Lys Phe 
1170 1175 1180 

Ala Ser Ala Gin Lys Lys Val Glu Lys Leu Lys Glu Ser Arg Ala Tyr 
1185 1190 1195 1200 

Met Phe Gin Ala Arg Pro Val Asp He Glu Thr Thr Ser Tyr Ala Val 
1205 1210 1215 

Leu Ser Tyr Leu Ala Gin Asn Gin Thr Ser Glu Ser Leu Ser He He 
1220 1225 1230 

Arg Trp Leu Val Ser Gin Arg Asn Glu Leu Gly Gly Phe Thr Ser Thr 
1235 1240 1245 

Gin Asp Thr Val Met Ala Leu Gin Ala Leu Ser Ser Tyr Ala Ala Val 
1250 1255 1260 

Thr Tyr Ser Asp Lys His Thr Ser Gin Val Thr He Leu Asn Gly Lys 
1265 1270 1275 1280 

His Thr His Ser Phe Asp lie Asn He Arg Asn Ala He Val Leu Gin 
1285 1290 1295 

Ser Tyr Gin Leu Ser Ser Leu Asn Asp Ala Val Ser He Asn Ala Asn 
1300 1305 1310 

Gly Thr Gly Val Val Phe Ala Gin Leu Ser Tyr Ser Tyr Tyr Arg Asp 
1315 1320 1325 

Ser Leu Asn Asp Asp Ala Pro Phe Phe Cys Ser Gin Glu lie Lys Glu 
1330 1335 1340 

He Arg Ala Gly Asn Arg Leu Gin Leu Asp Leu Cys Cys Asn Tyr Thr 
1345 1350 1355 1360 

Arg Pro Gly Lys Ser Asn Met Ala Leu Ala Glu He Asp Ala Leu Ser 
1365 1370 1375 
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Gly Tyr Arg Phe Asp Ala Glu Gin Val His Thr Leu Thr Ser He Glu 
1380 1385 1390 

Asp Leu Gin Arg Val Glu Met Glu Lys Asd Asp Thr Lvs Met Asn Val 
1395 1400 1405 

Tyr Phe Asn Pro .Leu Giy Gly Arg Pro val Cys Leu Ser Leu Tyr Ser 
1410 1415 1420 

Asp Val Thr Tyr Gin Val Ala Asp Gin Lys Pro Ala Asn Phe Arg Leu 
1425 1430 1435 1440 

Val Asp Tyr Tyr Asp Pro Glu Glu Gin Leu Lys Met Thr Tyr Ala Ala 
1445 1450 1455 

Lys Gin Thr Arg Ser Leu Gin Glu Lys Cys Gly Glu Asp Cys Trp Pro 
1460 1465 1470 

Pro He Ser Pro Ser Leu Pro Pro Phe Asp Glu Ser Thr Val Thr Gly 
1475 1480 1485 

Thr Ser Ser Gly Phe Gly Ala Lys Trp Cys Ala Leu He He Ala Val 
1490 1495 1500 

Leu Leu lie Ala 
1505 



<210> 71 
<211> 1519 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 71 

Met Arg Leu Leu He Leu Asn He Leu Phe Val Val Trp Gin He His 
15 10 15 

Gly Val He Gly Gin Ser Thr Asn Ala Ala Val Val Ser Thr Thr Ala 
20 25 30 

Ala Pro Val Lys Pro Ala Thr Tyr Met Leu Val Ala Pro Ala Val Val 
35 40 45 

Arg Pro Asp Gin Pro Phe Ser Val Cys Met Asn Leu Leu Lys Gin Ala 
50 55 60 

Thr Asp Glu Asp Met He Val Arg He Glu Val Arg Thr Glu Arg Asn 
65 70 75 80 

Glu Thr He Ala Ala Arg Val He Ser Asn Leu Lys Pro Gly He Ala 
85 90 95 

Gin Thr Val Ser Leu Ser Glu Met Pro Ala Gin Ser Leu Thr Pro Arg 
100 105 110 

Gin Ser Tyr Lys Leu Tyr He Arg Gly Glu Thr Leu Asn Ala Glu Leu 
115 120 125 
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He Phe Glu Asn Glu Asn Glu Leu Lys Tyr Asp Gin Lys Ala Leu Ser 

130 135 140 

Val Phe He Gin Thr Asp Arg Ala He Tyr Arg Pro Ala Ser Leu Val 
145 150 155 160 

Arg Tyr Arg Ala He Val Val Lys Ser Asp Leu Lys Pro Tyr Val Gly 
165 170 175 

Asn Ala Thr He Lys He Phe Asp Pro Ser Arg Asn Leu He Ser Gin 
180 185 190 

Thr He Gly Val Thr Leu Asp Arg Gly Val Tyr Ser Gly Glu Leu Gin 
195 200 205 

Leu Ala Glu Glu Thr Leu Leu Gly Asp Trp Phe He Glu Val Glu Thr 
210 215 220 

Ser Asn Gly Val Gin Asp Lys Ser Ser Phe Thr Val Asp Thr Tyr Val 
225 230 235 240 

Leu Pro Lys Phe Glu Val Asn He Lys Thr Ser Ser Phe He Thr He 
245 250 255 

Asn Asp Asp Leu Ser Val Phe Val Asp Ala Lys Tyr Thr Tyr Gly Lys 
260 265 270 

Gly Val Ala Gly Lys Ala Lys Val Ser Leu Glu Leu Pro Trp His Arg 
275 280 285 

Trp His Ala Met Val Pro Thr He lie Asp Glu Asn Gly Val Lys Lys 
290 295 300 

Glu Glu Glu Leu Met Val Glu Arg Thr Val Lys Leu Asn Arg Gin Gly 
305 310 315 320 

Glu Ala Ala Val Val Phe Ser Asn Asp Glu Leu Lys Arg His Lys Leu 
325 330 335 

Leu His Glu Trp Gly Gly Gly Ser He Arg He Val Ala Ser Val Thr 
340 "* 345 350 

Glu Asp He Thr Glu He Glu Arg Asn Ala Thr His Gin He Ser Thr 
355 360 365 

Phe Arg Glu Glu Val Lys Leu Asp Val Glu Lys Gin Gly Asp Thr Phe 
370 375 380 

Lys Pro Gly Leu Thr Tyr Asn Val Val Val Ala Leu Lys Gin Met Asp 
385 390 395 400 

Asp Thr Pro Val Lys Ala Thr Leu Pro Lys Arg Val Gin Val Ser Thr 
405 410 415 

Phe Tyr Asn Tyr Pro Tyr Asn His Asp Thr Ser Ser Leu Gin Glu Glu 
420 425 430 

Lys Glu Thr Lys He Val Glu Val Asp Ala His Gly Thr Ser Val Leu 
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435 440 445 

Thr Leu Gin Pro Pro lie Asn Cys Thr Ser Ala Arg lie Glu Ala His 
450 455 460 

Tyr Asp lie Gly Gly Lys Asp Asn Phe Thr Ala Thr Pro lie Tyr Ser 
46b 470 475 480 

Ser Leu Tyr Val Glu Ala Ala Val Ser Pro Thr Lys Ser Phe Leu Gin 
485 490 495 

Leu Leu Ala Asp Asn Glu Gly Ala Val Asp Val Gly Lys Ser Leu Ser 
500 505 510 

Phe Ser Leu Lys Ala Thr Gin Pro Leu Ser Thr lie Thr Tyr Gin Val 
515 520 525 

Met Ser Arg Ser Asn lie Val Val Ser Gin Gin Met Thr Val Asn Ser 
530 535 540 

Glu His Ala Thr He Ser Phe Pro Ala Thr Ala Asn Met Ala Pro Lys 
545 550 555 560 

Ser Arg Leu He Val Tyr Ala He He Glu Ser Ser Gin Glu Val Leu 
565 570 575 

Val Asp Ala Leu Asp Phe Lys Val Glu Gly He Phe Gin Asn Gin Val 
580 585 590 

Ala Leu Ser He Asp Lys Gin Ala Val Glu Pro Gly Gin Asn Val Lys 
595 600 605 

Phe Lys Val Thr Ser Asp Lys Asn Ser Phe Val Gly Leu Leu Val Val 
610 615 620 

Asp Gin Ser Val Leu Leu Leu Lys Thr Gly Asn Asp He Thr Arg Glu 
625 630 635 640 

Lys Val Glu Gin Asp Leu Glu Asn Tyr Asp Ser Asn Asn Val Gly Gly 
645 650 655 

Gly Phe Gly Gly Pro Arg Pro Trp Glu Ala He Asp Arg Lys Lys Arg 
660 665 ~ 670 

Ser He Trp Arg Pro Trp Trp Gly He Gly Gly Ser Asp Ala Gin Ser 
675 680 685 

He Phe Ser Asn Ala Gly Leu Val Val Leu Thr Asp Ala Leu Leu Tyr 
690 695 700 

Arg Glu Pro Gin Arg Glu Phe Met Ser Glu Arg Arg Leu Asn Thr Pro 
705 710 715 720 

Gly Gly Leu Thr Val Met Met Met Asp Gly Ala Pro Gly Met Ala Glu 
725 730 ' 735 

Ala Ala Phe Ala Ala Pro Pro Met Gly Gly Ser Ser Pro Pro Pro Pro 
740 745 750 
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Thr Val Arg Lys Phe Phe Pro His Thr Trp He Trp Ser Asp Leu Asn 
755 760 765 

Ser Thr Ser Gly Glu Val Glu Met Glu He Glu Ala Pro Asp Thr He 
770 775 780 

Thr Ser Trp Val Ala Ser Thr Phe Ala He Asn Glu Glu Asn Gly Leu 
785 790 795 800 

Glv Val Ala Pro Thr Thr Ser Lys Leu Arg Val Phe Arg Pro Phe Phe 
805 810 815 

He Gin Leu Asn Leu Pro Tyr Ala Val Arg Arg Gly Glu Lys Phe Ala 
820 825 830 

Leu Leu Val Leu Val Phe Asn Tyr Met Glu Lys Glu Gin Asp Val Thr 
835 840 845 

Val Thr Leu Lys Tyr Asp Lys Asp Ser Gly Tyr Asp Leu Leu Lys Lys 
850 855 860 

Aso Glv Thr Val Val Arg Arg Asp Glu Val Gly Gin Gin Asn Val Arg 
865 870 875 880 

He Val Ser Val Ala Gly Gly Gly Thr Ser Lys Ala Val Tyr Phe Pro 
885 " " 890 895 

He Val Pro Ser Ser He Gly Glu He Pro Val His He Ser Ala He 
900 905 910 

Ala Ser Gin Gly Gly Asp Ala Val Glu Met Asn Leu Arg Val Asp Pro 
915 920 925 

Gin Glv Tyr Lys Val Asp Arg Asn He Pro Phe Val He Asp Leu Asn 
930 935 940 

Asn Asn Ser Ser Asp Phe Ser Lys Asn Leu Glu Leu He Trp Pro Asn 
945 950 955 960 

Asp Val Val Asp Gly Ser Gin Lys Ala Arg Leu Asp Val He Gly Asp 
965 970 975 

Met Met Gly Pro Val Leu Asn Asn Ala His Lys Leu Val Gin Met Pro 
980 985 990 

Tvr Glv Cys Gly Glu Gin Asn Met Leu Asn Leu Val Pro Asn He Leu 
995 1000 1005 

Val Val Lys Tyr Leu Arg Ala Thr Asn Arg Asn Glu Ser Gin Leu Glu 
1010 1015 1020 

Thr Lys Ala He Lys Phe He Glu Gin Gly He Gin Arg Glu Leu Thr 
1025 1030 1035 1040 

Tvr Lvs Arg Ala Asp Asn Ser Phe Ser Ala Phe Gly Asp Ser Asp Lys 
1045 1050 1055 



BNSDOCID: <WO_0073328A2_I_> 



WO 00>73328 



PCT/EPOO/05108 



107 

Ala Gly Ser Thr Trp Leu Thr Ala Phe Val Val Arg Ser Phe His His 
1060 1065 1070 

Ala Lys Gin Tyr Ala Phe Val Asp Pro Asn Val He Ser Arg Ala Val 
1075 1080 1085 

Ala Phe Leu Asn Ser Gin Gin Met Glu Ser Gly Ala Phe Ala Glu Arg 
1090 1095 1100 

Gly Glu Val His His Lys Asp Met Gin Gly Gly Ala Gin Asp Gly Gly 
1105 mo 1115 H20 

Val Ala Leu Thr Ala Phe Val Leu He Ser He Leu Glu Asn Gly Met 
1125 1130 H35 

Glu Asn Gly Lys Ala Val Thr Tyr Leu Glu Lys His Leu Asp Glu Val 
1140 H45 1150 

Ser Gly Asn Ala Tyr Thr Met Ala Val Val Ala Tyr Ala Leu Gin Leu 
1155 1160 1165 

Ala Lys Ser Lys Gin Ala Gly Lys Ala Phe Glu Asn Leu Lys Lys His 
1170 H75 1180 

Lys He Val Glu Lys Ser Gly Asp Val Lys Phe Ala Ser Ala Gin Lys 
1185 1190 1195 1200 

Lys Val Glu Lys Leu Lys Glu Ser Arg Ala Tyr Met Phe Gin Ala Arg 
1205 1210 1215 

Pro Val Asp He Glu Thr Thr Ser Tyr Ala Val Leu Ser Tyr Leu Ala 
1220 1225 1230 

Gin Asn Gin Thr Ser Glu Ser Leu Ser He lie Arg Trp Leu Val Ser 
1235 1240 1245 

Gin Arg Asn Glu Leu Gly Gly Phe Thr Ser Thr Gin Asp Thr Val Met 
1250 1255 1260 

Ala Leu Gin Ala Leu Ser Ser Tyr Ala Ala Val Thr Tyr Ser Asp Lys 
1265 1270 1275 1280 

His Thr Ser Gin Val Thr He Leu Asn Gly Lys His Thr His Ser Phe 
1285 1290 1295 

Asp He Asn He Arg Asn Ala He Val Leu Gin Ser Tyr Gin Leu Ser 
1300 1305 1310 

Ser Leu Asn Asp Ala Val Ser He Asn Ala Asn Gly Thr Gly Val Val 
1315 1320 1325 

Phe Ala Gin Leu Ser Tyr Ser Tyr Tyr Arg Asp Ser Leu Asn Asp Asp 
1330 1335 1340 

Ala Pro Phe Phe Cys Ser Gin Glu He Lys Glu He Arg Ala Gly Asn 
1345 1350 1355 1360 

Arg Leu Gin Leu Asp Leu Cys Cys Asn Tyr Thr Arg Pro Gly Lys Ser 
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1365 1370 1375 

Asn Met Ala Leu Ala Glu lie Asp Ala Leu Ser Gly Tyr Arg Phe Asp 
1380 1385 1390 

Ala Glu Gin Val His Thr Leu Thr Ser lie Glu Asp Leu Gin Arg Val 
1395 1400 1405 

Glu Met Glu Lys Asp Asp Thr Lys Met Asn Val Tyr Phe Asn Pro Leu 
1410 ~ 1415 ^ 1420 

Gly Gly Arg Pro Val Cys Leu Ser Leu Tyr Ser Asp Val Thr Tyr Gin 
1425 1430 1435 1440 

Val Ala Asp Gin Lys Pro Ala Asn Phe Arg Leu Val Asp Tyr Tyr Asp 
1445 1450 1455 

Pro Glu Glu Gin Leu Lys Met Thr Tyr Ala Ala Lys Gin Thr Arg Ser 
1460 1465 1470 

Leu Gin Glu Lys Cys Gly Glu Asp Cys Trp Pro Pro lie Ser Pro Ser 
1475 1480 1485 

Leu Pro Pro Phe Asp Glu Ser Thr Val Thr Gly Thr Ser Ser Gly Phe 
1490 1495 1500 

Gly Ala Lys Trp Cys Ala Leu lie lie Ala Val Leu Leu lie Ala 
1505 1510 1515 



<210> 72 

<211> 1026 

<212> DNA 

<213> Caenorhabditis elegans 



<400> 72 

atgaatattg 

ggacttgaac 

aataagcctg 

agacaaattc 

gttaccgtac 

gtctcaccgt 

gctgtaggcg 

ggtgccaaat 

gcggaaaaga 

gttgttagaa 

aatgcaattg 

aaaatttcaa 

tgggaaatac 

ttgtacaatg 

ctgtgcagag 

atcaaattag 

agaaatagcg 

atgtaa 

<210> 73 
<211> 341 
<212> PRT 



tgtgggttat 
cagtaacact 
agaactggga 
tggattacgt 
actttgcatt 
caactctcga 
actcatttca 
cggatattca 
ttaaaccaat 
gatttgaaac 
aaaataaaca 
atatttcaca 
cttactcgag 
cggattactt 
ccaactatac 
gcgatactaa 
aactccagga 



aattttttgg 
agcagttcat 
tgggcctata 
cgctaaggtt 
ccgtctatca 
atgtggcgag 
attgtaccca 
tttcattgtc 
tgcaaatcaa 
aaatgaaacg 
agtcttccaa 
ctggtttgcg 
cagcttatgg 
tccagcgaga 
gttcaacctc 
cttctcaaaa 
tacataccca 



aagctacaga 
ggtactgcag 
tcatttggat 
tatagctgcg 
ccctttcaaa 
ttcctctcca 
agtaacttga 
gatggtgata 
attgttgatg 
acaatacccc 
tttcatcaca 
gtttccaatg 
gaagttcaag 
atcaaggtta 
ctatcacacg 
tccgttatag 
gatacgttgg 



aaggcatttt 
aaatgatgga 
tgtttattga 
atgaggagtt 
ctagctgccc 
acagaaaaaa 
tgagaaatat 
tgataatgag 
ggaaaaacaa 
acaaccatat 
gatttttctt 
aaaccgacga 
tgattcttca 
tgcagtcatt 
ttttcaacgt 
cacactctaa 
atagatgcgg 



ccgagaagat 
aatgatagaa 
ttttcattct 
tcagaaaaag 
acaaatcaaa 
gtttcgacgt 
tgcaagaaaa 
tgatggattt 
aaatgtattg 
agaattgaaa 
tgctgggcat 
aattactgct 
tcgtaacgac 
ggtgtacagc 
gcacaaagga 
gagaaatgga 
tcaatttgta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1026 
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<213> Caenorhabditis elegans 
<400> 73 

Met Asn lie Val Trp Val lie lie Phe Trp Lys Leu Gin Lys Gly lie 
15 10 15 

Phe Arg Glu Asp Gly Leu Glu Pro Val Thr Leu Ala Val His Gly Thr 
20 25 30 

Ala Glu Met Met Glu Met lie Glu Asn Lys Pro Glu Asn Trp Asp Gly 
35 40 4 5 

Pro lie Ser Phe Gly Leu Phe lie Asp Phe His Ser Arg Gin lie Leu 
50 55 ~ 60 

Asp Tyr Val Ala Lys Val Tyr Ser Cys Asp Glu Glu Phe Gin Lys Lys 
65 70 75 " 80 

Val Thr Val His Phe Ala Phe Arg Leu Ser Pro Phe Gin Thr Ser Cys 
85 90 95 

Pro Gin lie Lys Val Ser Pro Ser Thr Leu Glu Cys Gly Glu Phe Leu 
100 105 110 

Ser Asn Arg Lys Lys Phe Arg Arg Ala Val Gly Asp Ser Phe Gin Leu 
115 120 125 

Tyr Pro Ser Asn Leu Met Arg Asn lie Ala Arg Lys Gly Ala Lys Ser 
130 135 140 

Asp He His Phe He Val Asp Gly Asp Met He Met Ser Asp Gly Phe 
145 150 155 160 

Ala Glu Lys He Lys Pro lie Ala Asn Gin He Val Asp Gly Lys Asn 
165 170 ~ 175 

Lys Asn Val Leu Val Val Arg Arg Phe Glu Thr Asn Glu Thr Thr He 
180 185 190 

Pro His Asn His He Glu Leu Lys Asn Ala He Glu Asn Lys Gin Val 
195 200 205 

Phe Gin Phe His His Arg Phe Phe Phe Ala Gly His Lys He Ser Asn 
210 215 220 

He Ser His Trp Phe Ala Val Ser Asn Glu Thr Asp Glu He Thr Ala 
225 230 235 240 

Trp Glu He Pro Tyr Ser Ser Ser Leu Trp Glu Val Gin Val He Leu 
245 250 255 

His Arg Asn Asp Leu Tyr Asn Ala Asp Tyr Phe Pro Ala Arg He Lys 
260 265 270 

Val Met Gin Ser Leu Val Tyr Ser Leu Cys Arg Ala Asn Tyr Thr Phe 
275 280 285 

Asn Leu Leu Ser His Val Phe Asn Val His Lys Gly He Lys Leu Gly 
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290 295 300 



Asp Thr Asn Phe Ser Lys Ser Val lie Ala His Ser Lys Arg Asn Gly 
305 310 315 320 

Arg Asn Ser Glu Leu Gin Asp Thr Tyr Pro Asp Thr Leu Asp Arg Cys 
325 330 335 

Gly Gin Phe Val Met 
340 



<210> 74 
<211> 1869 
<212> DNA 

<213> Caenorhabditis elegans 



<400> 74 

atgcagtaca 

ggggcgtttc 

acacgtctta 

cgcctacatc 

tggaatattt 

gcttggattc 

gaaatcatcg 

accaatattt 

atggtggaaa 

gcattgggac 

aatggatggg 

actgccatgt 

atacaaattc 

tgtccggaga 

tacgcatttt 

cgacgtagac 

ttagacaatt 

tgcatggtga 

tggcggcatc 

aaagccgtca 

actgaatcct 

gaacacgcaa 

tacggtacaa 

gcattggaaa 

gatttggtga 

ccaagttctc 

gtagcacaaa 

gaatgccctt 

atgcaactga 

caaaatcata 

tgccttcaca 

ctaagatga 



ttgtagcgtc 
cggaagaaga 
tgttcatgca 
tgataactga 
caaactgcga 
caaattcaca 
gcaatgacat 
ttgatttgtg 
atctttccga 
gtggatttaa 
caagcaaatg 
cagatcaaga 
catgtgctta 
ctccacttgc 
ttgacaagat 
gatcttttaa 
tccgaattat 
cgcagttttc 
ctatatctac 
cagaattgaa 
ggatgctaga 
actgcaaata 
taatcgatca 
tgacataccc 
tagaacattt 
acgtacctac 
atgtaaatta 
tttacgatca 
aaatgatgaa 
atgcttcaaa 
cgttaaaaaa 



atattttaca 

ttacattcgc 

gcattttaaa 

tgaaaatcac 

gtggtttttt 

ttattcaaaa 

tggaaagatt 

gaaacaattt 

ttggtatctt 

cactggaata 

gagagtagtg 

catatttaat 

caattatcaa 

gctacatttc- 

tcgaaaagcg 

aggaaataat 

cccaaatgca 

caaagatcga 

agcagtatac 

tcgcacagat 

ctctttatat 

tattttaatg 

aactggaaac 

gcagctgagg 

attgaacaaa 

aaatatttca 

tgaaaagaac 

gcggttcgga 

ttataagttc 

atcattgaaa 

taaatttatg 



atatggaact 
cttgcgtata 
agtgtattga 
cgatccgata 
cacaatctca 
tattatggtt 
atgttcatgg 
agaaacttta 
aacaaggatg 
atcatgtttg 
gctaacaaat 
gcatacattc 
cttggagcac 
aactcacaaa 
tttgatgaaa 
caaaaagata 
attggaagaa 
cttaatcatt 
gggaaggata 
atcacaatcc 
cccataaatt 
actgatgtag 
ctgaaacaaa 
ttaaatttat 
accattcaaa 
aaatggataa 
tatgaacctt 
ggatttggct 
ttggtttctc 
cgctggagac 
aaaaaaactg 



ttgtggacca 
ttatcggtgg 
agtactctga 
ttcatgagct 
cagagttcga 
tgagcaaact 
atgttgatat 
acaattccca 
gtaaaaagtc 
atttggataa 
atttacgaat 
atgactatcc 
tcacaaaatc 
acaagactgt 
tggatggttc 
tctgccatga 
tgacaaaacc 
ttctggaaag 
aagatttgct 
atctggtttt 
ttttgagaaa 
attttgtggt 
aagaagttct 
caaattttct 
cttttcggga 
agtcaaatcg 
attttgtaat 
ggaataaagt 
caactagttt 
gtgacccaca 
ctagtcgact 



cacacgtgtt 
aaattttatg 
ccatttcttt 
catgacgtca 
aaaacgagtt 
tttaattcct 
catttttcaa 
ggttttcgga 
ggtttggcca 
gctccggaaa 
tcacggaaaa 
gactgaaatt 
caaagaacta 
tggaaaaaat 
agatcttaaa 
atatttacca 
tgcagaactc 
tgcaaatgcg 
agacattgca 
tgaggaacca 
tgtagcaatt 
acttggagat 
ggtgatccct 
ctcgagaaaa 
aacaatttgg 
cacttatatg 
caaaaaagaa 
aactcatgta 
tatgattcat 
ttatcagaaa 
gggaatcaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1869 



<210> 75 
<211> 622 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 75 

Met Gin Tyr lie Val Ala Ser Tyr Phe Thr lie Trp Asn Phe Val Asp 
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15 10 15 

His Thr Arg Val Gly Ala Phe Pro Glu Glu Asp Tyr lie Arg Leu Ala 
20 25 30 

Tyr He He Gly Gly Asn Phe Met Thr Ara Leu Met Phe Met Gin His 
35 40 45 

Phe Lys Ser Val Leu Lys Tyr Ser Asp His Phe Phe Arg Leu His Leu 
50 55 60 

He Thr Asp Glu Asn His Arg Ser Asp He His Glu Leu Met Thr Ser 
65 70 75 80 

Trp Asn He Ser Asn Cys Glu Trp Phe Phe His Asn Leu Thr Glu Phe 
85 90 95 

Glu Lys Arg Val Ala Trp He Pro Asn Ser His Tyr Ser Lys Tyr Tyr 
100 105 110 

Gly Leu Ser Lys Leu Leu He Pro Glu lie He Gly Asn Asp He Gly 
115 120 125 

Lys He Met Phe Met Asp Val Asp lie lie Phe Gin Thr Asn lie Phe 
130 135 140 

Asp Leu Trp Lys Gin Phe Arg Asn Phe Asn Asn Ser Gin Val Phe Gly 
145 150 155 160 

Met Val Glu Asn Leu Ser Asp Trp Tyr Leu Asn Lys Asp Gly Lys Lys 
165 170 175 

Ser Val Trp Pro Ala Leu Gly Arg Gly Phe Asn Thr Gly lie lie Met 
180 185 190 

Phe Asp Leu Asp Lys Leu Arg Lys Asn Gly Trp Ala Ser Lys Trp Arg 
195 * * 200 205 

Val Val Ala Asn Lys Tyr Leu Arg lie His Gly Lys Thr Ala Met Ser 
210 215 220 

Asp Gin Asp lie Phe Asn Ala Tyr lie His Asp Tyr Pro Thr Glu He 
225 230 235 240 

He Gin lie Pro Cys Ala Tyr Asn Tyr Gin Leu Gly Ala Leu Thr Lys 
245 250 255 

Ser Lys Glu Leu Cys Pro Glu Thr Pro Leu Ala Leu His Phe Asn Ser 
260 265 270 

Gin Asn Lys Thr Val Gly Lys Asn Tyr Ala Phe Phe Asp Lys lie Arg 
275 280 285 

Lys Ala Phe Asp Glu Met Asp Gly Ser Asp Leu Lys Arg Arg Arg Arg 
290 295 300 

Ser Phe Lys Gly Asn Asn Gin Lys Asp lie Cys His Glu Tyr Leu Pro 
305 310 315 320 
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Leu Asp Asn Phe Arg He He Pro Asn Ala He Gly Arg Met Thr Lys 
325 330 335 

Pro Ala Glu Leu Cys Met Val Thr Gin Phe Ser Lys Asp Arg Leu Asn 
340 345 350 

His Phe Leu Glu Ser Ala Asn Ala Trp Arg His Pro He Ser Thr Ala 
355 360 365 

Val Tyr Gly Lys Asp Lys Asp Leu Leu Asp He Ala Lys Ala Val Thr 
370 375 380 

Glu Leu Asn Arg Thr Asp He Thr He His Leu Val Phe Glu Glu Pro 
385 390 395 400 

Thr Glu Ser Trp Met Leu Asp Ser Leu Tyr Pro He Asn Phe Leu Arg 
405 * 410 415 

Asn Val Ala He Glu His Ala Asn Cys Lys Tyr He Leu Met Thr Asp 
420 425 430 

Val Asp Phe Val Val Leu Gly Asp Tyr Gly Thr He lie Asp Gin Thr 
435 440 445 

Gly Asn Leu Lys Gin Lys Glu Val Leu Val He Pro Ala Leu Glu Met 
450 455 460 

Thr Tyr Pro Gin Leu Arg Leu Asn Leu Ser Asn Phe Leu Ser Arg Lys 
465 * 470 475 480 

Asp Leu Val lie Glu His Leu Leu Asn Lys Thr He Gin Thr Phe Arg 
485 490 495 

Glu Thr He Trp Pro Ser Ser His Val Pro Thr Asn He Ser Lys Trp 
500 505 510 

He Lys Ser Asn Arg Thr Tyr Met Val Ala Gin Asn Val Asn Tyr Glu 
515 520 525 

Lys Asn Tyr Glu Pro Tyr Phe Val lie Lys Lys Glu Glu Cys Pro Phe 
530 535 540 

Tyr Asp Gin Arg Phe Gly Gly Phe Gly Trp Asn Lys Val Thr His Val 
545 550 555 560 

Met Gin Leu Lys Met Met Asn Tyr Lys Phe Leu Val Ser Pro Thr Ser 
565 ~ 570 575 

Phe Met lie His Gin Asn His Asn Ala Ser Lys Ser Leu Lys Arg Trp 
580 585 590 

Arg Arg Asp Pro His Tyr Gin Lys Cys Leu His Thr Leu Lys Asn Lys 
595 ~ 600 605 

Phe Met Lys Lys Thr Ala Ser Arg Leu Gly lie Lys Leu Arg 
610 615 620 
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<210> 76 
<211> 417 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 76 

Met Val Ser Leu Gin Lys Ser He Gly Leu Leu Leu Leu Ser Ala He 
1 5 10 15 

He Gly Leu Val Phe Leu He Gin His Arg Lys Ser Tyr Thr Ser Ser 
20 25 30 

Asp Ala Leu Leu Glu Asn Gly Tyr Pro Asn Lys Tyr Tyr Thr He Glu 
35 40 ^ 45 

Asn Pro Ala Glu Glu Gly Glu Arg Arg Ser Tyr Ser He Gin Thr Glu 
50 55 60 

Met His Ala Asp Gin Tyr Cys He Ala Tyr Lys Phe Leu Glu Ala Thr 
65 70 75 80 

Glu Ser Phe Arg Glu Ala Asp Gly Leu Glu Pro Val Thr Leu Ala Thr 
85 90 95 

His Ala Thr Ala Asp Met He Glu Thr Val Glu Asn Met Thr Phe Leu 
100 105 110 

Trp Asp Gly Pro He Ser He Gly He Phe Val Asp Tyr His Ser Tyr 
115 120 125 

Asn Val Leu Glu Tyr Leu Ala Glu Val His Arg Cys Asp Val Ser Phe 
130 135 140 

Arg Arg Lys Met Asn Val His Phe Ala Phe Arg Arg Ser Pro Phe Gin 
145 150 155 160 

Thr Glu Cys Pro Leu lie Glu He Pro Gin Ser Asn Arg Ser Cys Gin 
165 170 175 

Glu Phe Phe Ala Thr His Thr Glu Leu Arg Asn Ala He Val Gly Pro 
180 185 190 

Phe Gin Leu Tyr Pro Ser Asn Leu Met Arg Asn He Ala Arg Lys Gly 
195 200 205 

Ala Gin Thr Asp Leu Gin Phe He Met Asp Gly Asp Met Val Pro Ser 
210 215 220 

Glu Gly Phe Ala Thr Lys He Lys Arg He Ala Asn Glu Val He Asp 
225 230 235 240 

Gly Lys Asn Lys Arg Val Leu Ala lie Arg Arg Phe Glu Thr Ser Asp 
245 250 255 

Thr Ala Glu He Pro Arg Asp His Leu Lys Leu Leu Lys Ser Lys Lys 
260 265 8 270 
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Leu His Lys 
275 

His lie Asp 
290 

Val Val Thr 
305 

Gin Thr He 



Ser Arg He 

Gly Tyr Thr 
355 

He Lys His 
370 

Glu Ala Tyr 
385 

Asp Asp Leu 
He 



Thr Phe Glu 

Gly Leu Asp 

Thr Lys Glu 
310 

Leu His Arg 
325 

Lys Val Met 
340 

Phe His Val 
Thr Asn Thr 



Ala Met Lys 
390 

Tyr Pro His 
405 



Phe His 
280 

Asp Trp 
295 

Val Ala 



Asn Asp 



His Ser 



Pro Thr 
360 

lie Tyr 
375 

Glu Ala 



Thr Leu 



114 

His Arg Tyr 



Phe Arg Thr 

Tyr Pro Gly 
315 

Pro Tyr Asn 
330 

Leu Val Tyr 
345 

His Val Phe 
Ser Lys Ala 



Gly Asp Arg 
395 

Ser Gin Cys 
410 



Phe Pro 
285 

Ser He 
300 

Tyr Leu 

Ala Asp 

Ala Leu 

Asp Ser 
365 

Thr He 
380 

Tyr He 
Gly Glu 



Glu Gly His 

His Ser Gly 

Trp Glu Val 
320 

Tyr Phe Pro 
335 

Cys Arg Ala 
350 

His Arg Gly 
Ala His Gin 



Lys Glu Met 
400 

Phe Ser Met 
415 



<210> 77 
<211> 1050 
<212> DNA 

<213> Caenorhabditis elegans 



<400> 77 

atgcacgacg 

gaagacggct 

ttggggaata 

cattcccaaa 

agacagatga 

attcaagttc 

gaggaaattg 

cgcggagcaa 

tttgcaaaga 

ttggttgtta 

gaggagggaa 

catcaaattc 

gcatgggaga 

gatccatata 

aagttatgcc 

ggaatcaaag 

ggaagattga 

atgaaacgat 

<210> 78 
<211> 349 



agcagttttg 
tggaaccagt 
agcctttgaa 
aggccctgaa 
ctgtccactt 
ttgggtatca 
aggactcttt 
agtcggattt 
tggtaaaacc 
gacgtttcga 
ttgaaaatca 
ccaacttgat 
ttccatacac 
atgtagagta 
gtgcaaacta 
aagatgatac 
ggacgctttc 
gtgggcagtt 



tgttggttat 
aactcttgct 
ctgggatggg 
ttatatttcc 
tgcattccga 
ggattgtgcc 
tcaaatctac 
acacttgata 
aatcgcaaat 
gaccaacgaa 
taaaacattc 
ggaatggttc 
tggaaatgat 
ttttccttcg 
caccttcaat 
catgtactcg 
tcgatatgtc 
tttgttataa 



aacttcctgg 
attcatggca 
cctatttcat 
atgctacata 
atctcaccat 
acatttttac 
ccgataaacc 
atcgatacag 
cggatgattg 
aatgagctac 
cagttccatc 
gaaagatctc 
tgggaagttc 
cgagtcaagg 
ttgctctctc 
aaggttgtga 
actgaaattg 



aggccgaaga 
ctccagaagt 
ttggtctatt 
aatgtgatgc 
ctcaatccga 
agaaaagcaa 
taatgagaaa 
atatgatgat 
atgggaagaa 
caatgagctt 
acaaattctt 
acgcctctaa 
aaatcattct 
atatgcagtc 
atgtgttcaa 
ctgctcacac 
acaggaaata 



tacattccga 
tttgcaacta 
cgtagatttt 
agcttttaaa 
atgcccaatg 
gcagctcctt 
tattgctcgg 
gagcaccaac 
taagcaagtg 

tggggatctt 

tttcgttggt 
tgatgtggtg 
tcaccgcaac 
tttgatctac 
tgttcataaa 
aaaacggcaa 
cccggatacc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1050 
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<212> PRT 

<213> Caenorhabditis elegans 
<400> 78 

Met His Asp Glu Gin Phe Cys Val Gly Tyr Asn Phe Leu Glu Ala Glu 
15 10 15 

Asp Thr Phe Arg Glu Asp Gly Leu Glu Pro Val Thr Leu Ala He His 
20 25 30 

Gly Thr Pro Glu Val Leu Gin Leu Leu Gly Asn Lys Pro Leu Asn Trp 
35 40 45 

Asp Gly Pro He Ser Phe Gly Leu Phe Val Asp Phe His Ser Gin Lys 
50 55 60 

Ala Leu Asn Tyr He Ser Met Leu His Lys Cys Asp Ala Ala Phe Lys 
65 70 75 * 80 

Arg Gin Met Thr Val His Phe Ala Phe Arg He Ser Pro Ser Gin Ser 
85 90 95 

Glu Cys Pro Met He Gin Val Leu Gly Tyr Gin Asp Cys Ala Thr Phe 
100 105 110 

Leu Gin Lys Ser Lys Gin Leu Leu Glu Glu lie Glu Asp Ser Phe Gin 
115 120 125 

He Tyr Pro He Asn Leu Met Arg Asn He Ala Arg Arg Gly Ala Lys 
130 135 140 

Ser Asp Leu His Leu He lie Asp Thr Asp Met Met Met Ser Thr Asn 
145 150 155 160 

Phe Ala Lys Met Val Lys Pro lie Ala Asn Arg Met lie Asp Gly Lys 
165 170 175 

Asn Lys Gin Val Leu Val Val Arg Arg Phe Glu Thr Asn Glu Asn Glu 
180 185 190 

Leu Pro Met Ser Phe Gly Asp Leu Glu Glu Gly He Glu Asn His Lys 
195 200 205 

Thr Phe Gin Phe His His Lys Phe Phe Phe Val Gly His Gin lie Pro 
210 215 220 

Asn Leu Met Glu Trp Phe Glu Arg Ser His Ala Ser Asn Asp Val Val 
225 230 235 240 

Ala Trp Glu lie Pro Tyr Thr Gly Asn Asp Trp Glu Val Gin He He 
245 250 255 

Leu His Arg Asn Asp Pro Tyr Asn Val Glu Tyr Phe Pro Ser Arg Val 
260 265 270 

Lys Asp Met Gin Ser Leu lie Tyr Lys Leu Cys Arg Ala Asn Tyr Thr 
275 280 285 
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Phe Asn Leu Leu Ser His Val Phe 
290 295 

Asp Asp Thr Met Tyr Ser Lys Val 
305 * 310 

Gly Arg Leu Arg Thr Leu Ser Arg 
325 

Tyr Pro Asp Thr Met Lys Arg Cys 
340 



116 

Asn Val His Lys Gly lie Lys Glu 
300 

Val Thr Ala His Thr Lys Arg Gin 
315 320 

Tyr Val Thr Glu lie Asp Arg Lys 
330 335 

Gly Gin Phe Leu Leu 
345 



<210> 79 
<211> 1167 
<212> DNA 

<213> Caenorhabditis elegans 



<400> 79 

atgctcaaga tttcctcaag atttactcca 
ctttgtttgt ggtttttgaa gaaatattct 
tatgaaaatg agttttgcat tggctacaat 
gacggcttgg agcctgtgac acttgccatt 
gagaagaagc catcaaactg ggatgggcct 
tcccagaagg ctctggaata tgtggcaatg 
aaagtcaccg ttcactatgt gttccgaact 
actcctgatg tgtcggtgaa ttgtgatgaa 
gaaataacct ccccgtttca aatctaccca 
ggagcaactt ctgatctaca cttgatagtc 
gcgagaaaag tgaagccaat cgcaaatcgc 
gtagttcgac gttttgagac aaacgaagat 
atgggatttg agaatcaaaa agtattcgag 
aaaattccag acgtggaaaa atggtttcac 
tgggaaattc catactcagg aaatgcatgg 
atgtacaacg ccgagtactt tccgtctaga 
ctctgccgag ccaactacac cttcaacttg 
atcaaagagg atgacacaat gtactcgaaa 
aggaatagag cattctcccg ctacgtccat 
cagcggtgcg ggaagtttga gatgtga 



tttgctttgt ttctcctatt ttcaattcta 60 
caagatcttt ctaggatctc tatagaactt 120 
ttcctggagg ctacagaaaa attccgagaa 180 
catgggacat ccgatgtcct tgaagtagtg 240 
atatcattcg ggatgtttgt tgactatcac 300 
cttcatcagt gtgataagga gttcggggag 360 
tctccttccc agatggattg tccagtgata 420 
tttcgtcgga atcgaaagca gctcctcaaa 480 
ataaacttga tgagaaatgt tgcccgccgt 540 
gacgctgata tgacaatgag ctctgatttt 600 
ataattgatg ggaaacagag acaagttttg 660 
gagattccac tggaagttga gcagctgaag 720 
ttccatcaca attttttctt tattgggcat 780 
gcctcgaaga cagaaaatga agtgactgcc 840 
gaagtgcaag tgattcttca ccggaacgac 900 
atccgagaca tgcagtcttt gatctacggt 960 
ctctctcacg tattcaatgt tcaccaaggc 1020 
gttgtcacag ctcactcgaa gcgatatgga 1080 
gagatgaata ctgcgtatcc gggaactatt 1140 

1167 



<210> 80 
<211> 388 
<212> PRT 

<213> Caenorhabditis elegans 



<400> 80 
Met Leu Lys lie 
1 

Phe Ser lie Leu 
20 

Leu Ser Arg lie 
35 

Tyr Asn Phe Leu 
50 



Ser Ser Arg Phe 
5 

Leu Cys Leu Trp 

Ser lie Glu Leu 
40 

Glu Ala Thr Glu 
55 



Thr Pro Phe Ala 
10 

Phe Leu Lys Lys 
25 

Tyr Glu Asn Glu 



Lys Phe Arg Glu 
60 



Leu Phe Leu Leu 
15 

Tyr Ser Gin Asp 
30 

Phe Cys lie Gly 
45 

Asp Gly Leu Glu 
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Pro Val Thr Leu Ala He His Gly Thr Ser Asp Val Leu Glu Val Val 
65 70 75 80 

Glu Lys Lys Pro Ser Asn Trp Asp Gly Pro He Ser Phe Gly Met Phe 
85 90 - 95 

Val Asp Tyr His Ser Gin Lys Ala Leu Glu Tyr Val Ala Met Leu His 
100 105 110 

Gin Cys Asp Lys Glu Phe Gly Glu Lys Val Thr Val His Tyr Val Phe 
115 120 125 

Arg Thr Ser Pro Ser Gin Met Asp Cys Pro Val He Thr Pro Asp Val 
130 135 140 

Ser Val Asn Cys Asp Glu Phe Arg Arg Asn Arg Lys Gin Leu Leu Lys 
145 150 155 160 

Glu He Thr Ser Pro Phe Gin He Tyr Pro He Asn Leu Met Arg Asn 
165 170 175 

Val Ala Arg Arg Gly Ala Thr Ser Asp Leu His Leu He Val Asp Ala 
180 185 190 

Asp Met Thr Met Ser Ser Asp Phe Ala Arg Lys Val Lys Pro He Ala 
195 200 205 

Asn Arg He He Asp Gly Lys Gin Arg Gin Val Leu Val Val Arg Arg 
210 215 220 

Phe Glu Thr Asn Glu Asp Glu lie Pro Leu Glu Val Glu Gin Leu Lys 
225 230 235 240 

Met Gly Phe Glu Asn Gin Lys Val Phe Glu Phe His His Asn Phe Phe 
245 250 255 

Phe He Gly His Lys He Pro Asp Val Glu Lys Trp Phe His Ala Ser 
260 265 270 

Lys Thr Glu Asn Glu Val Thr Ala Trp Glu He Pro Tyr Ser Gly Asn 
275 280 285 

Ala Trp Glu Val Gin Val He Leu His Arg Asn Asp Met Tyr Asn Ala 
290 295 300 

Glu Tyr Phe Pro Ser Arg lie Arg Asp Met Gin Ser Leu lie Tyr Gly 
305 310 315 320 

Leu Cys Arg Ala Asn Tyr Thr Phe Asn Leu Leu Ser His Val Phe Asn 
325 330 335 

Val His Gin Gly lie Lys Glu Asp Asp Thr Met Tyr Ser Lys Val Val 
340 345 350 

Thr Ala His Ser Lys Arg Tyr Gly Arg Asn Arg Ala Phe Ser Arg Tyr 
355 360 365 

Val His Glu Met Asn Thr Ala Tyr Pro Gly Thr lie Gin Arg Cys Gly 
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370 375 380 

Lys Phe Glu Met 
385 



<210> 81 
<211> 1275 
<212> DNA 

<213> Caenorhabditis elegans 



<400> 81 

atgtgtactt 

tttgctgcat 

atatgcgtca 

tctgtacaat 

ctacacggaa 

cgaatctcac 

gtcacgtttt 

tacccgataa 

atagtcgata 

aatcggatga 

gaaaccgaac 

tttgaattcc 

tttgaaaatt 

gattgggaag 

tcgcgagtca 

aatttgctct 

tcgaaagtcg 

tgtagagaat 

aaatcgacaa 

ttttcaggag 

tttaaagccg 

tatggagaaa 



ttaaaaaatt 
cactcgttaa 
aaattgaaac 
tcaaaggtaa 
aaccagaaaa 
catctcaaac 
tccaaaagaa 
atctaatgag 
cagatatggt 
ttgatgggat 
ttccactgaa 
atcactcgtt 
cttacgcatc 
ttcaaataat 
gggatatgca 
ctcacgtatt 
tcactgctca 
tcccaagata 
gcagtaccca 
tattcagaat 
gaaaagcgta 
aataa 



cgatggagaa 
tctcgatctc 
ttctcacttt 
tgctcctggt 
taattataat 
cgaatgtcct 
cacagagctc 
aaatattgct 
aatgagtact 
gaataaacaa 
cttggacgaa 
cttttttgtt 
agaagaaacc 
tct tcaccgc 
gtctttgatt 
caatgttcac 
cacaaagcaa 
tgcttgtgaa 
gacactacaa 
gttcacgcaa 
ctgttttgtt 



actcgaaaaa 
aagcctgtga 
acctctggca 
tctgatgctg 
tccttacaga 
gtgatctata 
cttgaggaaa 
cgacgcggag 
aactttgcaa 
gtcttggttg 
cttgagcaag 
ggccatcaaa 
actgcatggg 
aacgacccat 
tataaactct 
aaggggatca 
ttttggaaaa 
tttacagaac 
caagataatt 
ttctcgaaat 
gtttctgtca 



ctcggatcca 
agctaaattc 
cttattatat 
aaggaaggtt 
cgactgtaca 
cttccgggta 
tggaggaccc 
caaagtcgga 
agatggtaaa 
ttcgacgctt 
ggcttctgaa 
tacccaactt 
agattccata 
ataacattga 
gccgtgcgaa 
aagaagatga 
tgaggtattt 
gctttcccgt 
tgccagatgt 
tttcagagca 
cttttctggt 



gattttgtat 
caatgcgaat 
taacttggca 
tttcaagaaa 
ctttgcattc 
taaggattgt 
ttttcagatc 
tttacacttg 
accagttgcg 
cgagaccaac 
tgagaacaca 
gtctgagtgg 
cacaggaagt 
gtacttccca 
ctacacattc 
tacaatgtac 
atttttttgt 
tacactgccg 
ttccttattt 
tttgaacatt 
gtctttaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1275 



<210> 82 
<211> 424 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 82 

Met Cys Thr Phe Lys Lys Phe Asp Gly Glu Thr 
15 10 



Arg Lys Thr Arg lie 
15 



Gin lie Leu Tyr Phe Ala Ala Ser Leu Val Asn Leu Asp Leu Lys Pro 

20 25 30 

Val Lvs Leu Asn Ser Asn Ala Asn lie Cys Val Lys lie Glu Thr Ser 

35 40 45 

His Phe Thr Ser Gly Thr Tyr Tyr lie Asn Leu Ala Ser Val Gin Phe 

50 55 * 60 

Lys Gly Asn Ala Pro Gly Ser Asp Ala Glu Gly Arg Phe Phe Lys Lys 

65 " 70 75 80 



Leu His Gly Lys Pro Glu Asn Asn Tyr Asn Ser Leu Gin Thr Thr Val 
85 90 95 
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His Phe Ala Phe Arg lie Ser Pro Ser Gin Thr Glu Cys Pro Val lie 

100 105 110 

Tyr Thr Ser Gly Tyr Lys Asp Cys Val Thr Phe Phe Gin Lys Asn Thr 
115 120 125 

Glu Leu Leu Glu Glu Met Glu Asp Pro Phe Gin He Tyr Pro He Asn 
130 135 140 

Leu Met Arg Asn He Ala Arg Arg Gly Ala Lys Ser Asp Leu His Leu 
145 150 155 160 

He Val Asp Thr Asp Met Val Met Ser Thr Asn Phe Ala Lys Met Val 
165 170 175 

Lys Pro Val Ala Asn Arg Met He Asp Gly Met Asn Lys Gin Val Leu 
180 185 190 

Val Val Arg Arg Phe Glu Thr Asn Glu Thr Glu Leu Pro Leu Asn Leu 
195 200 205 

Asp Glu Leu Glu Gin Gly Leu Leu Asn Glu Asn Thr Phe Glu Phe His 
210 215 220 

His Ser Phe Phe Phe Val Gly His Gin He Pro Asn Leu Ser Glu Trp 
225 230 235 240 

Phe Glu Asn Ser Tyr Ala Ser Glu Glu Thr Thr Ala Trp Glu He Pro 
245 250 255 

Tyr Thr Gly Ser Asp Trp Glu Val Gin He lie Leu His Arg Asn Asp 
260 265 270 

Pro Tyr Asn He Glu Tyr Phe Pro Ser Arg Val Arg Asp Met Gin Ser 
275 280 285 

Leu He Tyr Lys Leu Cys Arg Ala Asn Tyr Thr Phe Asn Leu Leu Ser 
290 295 300 

His Val Phe Asn Val His Lys Gly He Lys Glu Asp Asp Thr Met Tyr 
305 310 315 320 

Ser Lys Val Val Thr Ala His Thr Lys Gin Phe Trp Lys Met Arg Tyr 
325 330 335 

Leu Phe Phe Cys Cys Arg Glu Phe Pro Arg Tyr Ala Cys Glu Phe Thr 
340 345 350 

Glu Arg Phe Pro Val Thr Leu Pro Lys Ser Thr Ser Ser Thr Gin Thr 
355 360 365 

Leu Gin Gin Asp Asn Leu Pro Asp Val Ser Leu Phe Phe Ser Gly Val 
370 375 380 

Phe Arg Met Phe Thr Gin Phe Ser Lys Phe Ser Glu His Leu Asn He 
385 390 395 400 
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Phe Lys Ala Gly Lys Ala Tyr Cys Phe Val Val Ser Val Thr Phe Leu 
405 " 410 415 

Val Ser Leu Lys Tyr Gly Glu Lys 
420 



<210> 83 
<211> 370 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 83 

Met Glu Asp Asp Thr Pro Asp Val Ser Ser Asp Ser Asn Gly Asp Ala 
15 10 15 

Ala Tyr Ser Asp Tyr Phe Leu Asp Tyr Lys Ser He Met Asp Glu He 
20 25 30 

Thr He Thr Thr Gin Pro Lys Ser Gly Tyr Val He Arg Asn Lys Pro 
35 40 45 

Leu Arg Leu Gin Cys Arg Ala Asn His Ala Thr Lys He Arg Tyr Lys 
50 55 60 

Cys Ser Ser Lys Trp He Asp Asp Ser Arg He Glu Lys Leu He Gly 
65 ^ 70 75 80 

Thr Asp Ser Thr Ser Gly Val Gly Tyr He Asp Ala Ser Val Asp He 

85 90 95 

Ser Arg lie Asp Val Asp Thr Ser Gly His Val Asp Ala Phe Gin Cys 
100 105 110 

Gin Cys Tyr Ala Ser Gly Asp Asp Asp Gin Asp Val Val Ala Ser Asp 
115 120 125 

Val Ala Thr Val His Leu Ala Tyr Met Arg Lys His Phe Leu Lys Ser 
130 135 140 

Pro Val Ala Gin Arg Val Gin Glu Gly Thr Thr Leu Gin Leu Pro Cys 
145 150 155 160 

Gin Ala Pro Glu Ser Asp Pro Lys Ala Glu Leu Thr Trp Tyr Lys Asp 
165 170 175 

Gly Val Val Val Gin Pro Asp Ala Asn Val lie Arg Ala Ser Asp Gly 
180 185 190 

Ser Leu lie Met Ser Ala Ala Arg Leu Ser Asp Ser Gly Asn Tyr Thr 
195 200 205 

Cys Glu Ala Thr Asn Val Ala Asn Ser Arg Lys Thr Asp Pro Val Glu 
210 215 220 

Val Gin lie Tyr Val Asp Gly Gly Trp Ser Glu Trp Ser Pro Trp lie 
225 230 235 240 
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Gly Thr Cys His Val Asp Cys Pro Leu Leu Arg Gin His Ala His Arg 
245 250 255 

lie Arg Asp Pro His Asp Val Leu Pro His Gin Arg Arg Thr Arg Thr 
260 265 270 

Cys Asn Asn Pro Ala Pro Leu Asn Asp Gly Glu Tyr Cys Lys Gly Glu 
275 280 285 

Glu Glu Met Thr Arg Ser Cys Lys Val Pro Cys Lys Leu Asp Gly Gly 
290 295 300 

Trp Ser Ser Trp Ser Asp Trp Ser Ala Cys Ser Ser Ser Cys His Arg 
305 310 315 320 

Tyr Arg Thr Arg Ala Cys Thr Val Pro Pro Pro Met Asn Gly Gly Gin 
325 330 335 

Pro Cys Phe Gly Asp Asp Leu Met Thr Gin Glu Cys Pro Ala Gin Leu 
340 345 350 

Cys Thr Ala Asp Ser Ser Arg lie Val lie Ser Asp Thr Ala Val Tyr 
355 360 365 

Gly Ser 
370 



<210> 84 
<211> 20 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 84 

Val Ala Ser lie Phe lie Val Ala Ser Phe lie Leu Ala lie Leu Ala 
15 10 15 

Met Phe Cys Cys 
20 



<210> 85 
<211> 122 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 85 

Lys Arg Gly Asn Ser Lys Lys Ser Lys Pro Leu Lys Pro Gin Lys Met 
15 10 15 

Asn Ser Glu Lys Ala Gly Gly lie Tyr Tyr Ser Glu Pro Pro Gly Val 
20 25 30 

Arg Arg Leu Leu Leu Glu His Gin His Gly Thr Leu Leu Gly Glu Lys 
35 40 45 

lie Ser Ser Cys Ser Gin Tyr Phe Glu Pro Pro Pro Leu Pro His Ser 
50 55 60 
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Thr Thr Leu Arg Ser Gly Lys Ser Ala Phe Ser Gly Tyr Ser Ser Thr 
65 70 75 80 

Arg Asn Ala Gly Ser Arg Ala Ala Leu He Gin Glu Cys Ser Ser Ser 

85 90 95 

Ser Ser Gly Ser Gly Gly Lys Arg Thr Met Leu Arg Thr Ser Ser Ser 
100 " 105 HO 

Asn Cys Ser Asp Asp Asp Asn Tyr Ala Thr 
115 120 



<210> 86 
<211> 165 
<212> PRT 

<213> Caenorhabditis elegans 

<400> 86 . 
Leu Tyr Asp Tyr Met Glu Asp Lys Ser Val Leu Gly Leu Asp Thr Ser 
1 5 10 I 5 

Gin Asn He Val Ala Ala Gin He Asp Ser Asn Gly Ala Arg Leu Ser 
20 25 30 

Leu Ser Lys Ser Gly Ala Arg Leu He Val Pro Glu Leu Ala Val Glu 
35 40 45 

Glv Glu Lys Met Leu Tyr Leu Ala Val Ser Asp Thr Leu Thr Asp Gin 
50 55 60 

Pro His Leu Lys Pro He Glu Ser Ala Leu Ser Pro Val He Val lie 
65 70 75 80 

Gly Gin Cys Asp Val Ser Met Ser Ala His Asp Asn He Leu Arg Arg 

85 90 95 

Pro Val Val Val Ser Phe Arg His Cys Ala Ser Thr Phe Pro Arg Asp 
100 105 HO 

Asn Trp Gin Phe Thr Leu Tyr Ala Asp Glu Gly Ser Gly Trp Gin Lys 
115 120 125 

Ala val Thr He Gly Glu Glu Asn Leu Asn Thr Asn Met Phe Val Gin 
130 ' 135 140 

Phe Glu Gin Pro Gly Lys Lys Asn Asp Gly Phe Gly Trp Cys His Val 
145 150 155 160 

Met Thr Tyr Ser Leu 
165 



<210> 87 
<211> 157 
<212> PRT 

<213> Caenorhabditis elegans 
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<400> 87 

Ala Arg Leu Met Leu Ala Gly His Pro Arg Arg Asn Ser Leu Ser Ala 
15 10 15 

Ala Lys Arg Val His Leu Ala Val Phe Gly Pro Thr Glu Met Ser Ala 
20 25 30 

Tyr Arg Arg Pro Phe Glu Leu Arg Val Tyr Cys Val Pro Glu Thr Gly 
35 40 45 

Ala Ala Met Glu Ser Val Trp Lys Gin Glu Asp Gly Ser Arg Leu Leu 
50 55 60 

Cys Glu Ser Asn Asp Phe lie Leu Asn Glu Lys Gly Asn Leu Cys He 
65 70 7 5 80 

Cys He Glu Asp Val He Pro Gly Phe Ser Cys Asp Gly Pro Glu Val 
85 90 ' 95 

Val Glu He Ser Glu Thr Gin His Arg Phe Val Ala Gin Asn Gly Leu 
100 105 HO 

His Cys Ser Leu Lys Phe Arg Pro Lys Glu He Asn Gly Ser Gin Phe 
115 120 125 

Ser Thr Arg Val He Val Tyr Gin Lys Ala Ser Ser Thr Glu Pro Met 
130 135 140 

Val Met Glu Val Ser Asn Glu Pro Glu Leu Tyr Asp Ala 
145 150 155 



<210> 88 
<211> 113 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 88 

Thr Ser Glu Glu Arg Glu Lys Gly Ser Val Cys Val Glu Phe Arg Leu 
1 5 10 15 

Pro Phe Gly Val Lys Asp Glu Leu Ala Arg Leu Leu Asp Met Pro Asn 
20 25 30 

Glu Ser His Ser Asp Trp Arg Gly Leu Ala Lys Lys Leu His Tyr Asp 
35 40 45 

Arg Tyr Leu Gin Phe Phe Ala Ser Phe Pro Asp Cys Ser Pro Thr Ser 
50 55 60 

Leu Leu Leu Asp Leu Trp Glu Ala Ser Ser Ser Gly Ser Ala Arg Ala 
65 70 75 80 

Val Pro Asp Leu Leu Gin Thr Leu Arg Val Met Gly Arg Pro Asp Ala 
85 90 95 

Val Met Val Leu Glu Arg Phe Leu Ser Ala Phe Pro Gin He Val Ser 
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100 105 HO 



Pro 



<210> 89 
<211> 437 
<212> PRT 

<213> Homo sapiens 
<400> 89 

His Met Ala Thr Leu His His Ser Ser Pro Thr Ser Glu Ala Glu Glu 
15 10 15 

Phe Val Ser Arg Leu Ser Thr Gin Asn Tyr Phe Arg Ser Leu Pro Arg 
20 25 30 

Gly Thr Ser Asn Met Thr Tyr Gly Thr Phe Asn Phe Leu Gly Gly Arg 
35 40 45 

Leu Met He Pro Asn Thr Gly He Ser Leu Leu He Pro Pro Asp Ala 
50 55 60 

He Pro Arg Gly Lys He Tyr Glu He Tyr Leu Thr Leu His Lys Pro 
65 "* 70 75 80 

Glu Asp Val Arg Leu Pro Leu Ala Gly Cys Gin Thr Leu Leu Ser Pro 

85 90 95 

He Val Ser Cys Gly Pro Pro Gly Val Leu Leu Thr Arg Pro Val He 
100 105 110 

Leu Ala Met Asp His Cys Gly Glu Pro Ser Pro Asp Ser Trp Ser Leu 
115 1 120 125 

Arg' Leu Lys Lys Gin Ser Cys Glu Gly Ser Trp Glu Asp Val Leu His 
130 1 135 140 

Leu Gly Glu Glu Ala Pro Ser His Leu Tyr Tyr Cys Gin Leu Glu Ala 
145 150 155 160 

Ser Ala Cys Tyr Val Phe Thr Glu Gin Leu Gly Arg Phe Ala Leu Val 
165 170 175 

Gly Glu Ala Leu Ser Val Ala Ala Ala Lys Arg Leu Lys Leu Leu Leu 
180 185 190 

Phe Ala Pro Val Ala Cys Thr Ser Leu Glu Tyr Asn He Arg Val Tyr 
195 200 205 

Cys Leu His Asp Thr His Asp Ala Leu Lys Glu Val Val Gin Leu Glu 
210 215 220 

Lys Gin Leu Gly Gly Gin Leu He Gin Glu Pro Arg Val Leu His Phe 
225 230 235 240 

Lys Asp Ser Tyr His Asn Leu Arg Leu Ser He His Asp Val Pro Ser 
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245 250 255 

Ser Leu Trp Lys Ser Lys Leu Leu Val Ser Tyr Gin Glu lie Pro Phe 
260 265 ' 270 

Tyr His lie Trp Asn Gly Thr Gin Arg Tyr Leu His Cys Thr Phe Thr 
275 280 285 

Leu Glu Arg Val Ser Pro Ser Thr Ser Asp Leu Ala Cys Lys Leu Trp 
290 295 300 

Val Trp Gin Val Glu Gly Asp Gly Gin Ser Phe Ser He Asn Phe Asn 
305 310 315 320 

He Thr Lys Asp Thr Arg Phe Ala Glu Leu Leu Ala Leu Glu Ser Glu 
325 330 335 

Ala Gly Val Gin Ala Leu Val Gly Pro Ser Ala Phe Lys He Pro Phe 
340 345 350 

Leu He Arg Gin Lys He lie Ser Ser Leu Asp Pro Pro Cys Arg Arg 
355 360 365 

Gly Ala Asp Trp Arg Thr Leu Ala Gin Lys Leu His Leu Asp Ser His 
370 375 380 

Leu Ser Phe Phe Ala Ser Lys Pro Ser Pro Thr Ala Met He Leu Asn 
385 390 395 400 

Leu Trp Glu Ala Arg His Phe Pro Asn Gly Asn Leu Ser Gin Leu Ala 
405 410 415 

Ala Ala Val Ala Gly Leu Gly Gin Pro Asp Ala Gly Leu Phe Thr Val 
420 425 430 

Ser Glu Ala Glu Cys 
435 



<210> 90 

<211> 931 

<212> PRT 

<213> Homo sapiens 

<400> 90 

Met Arg Lys Gly Leu Arg Ala Thr Ala Ala Arg Cys Gly Leu Gly Leu 
1 5 10 15 

Gly Tyr Leu Leu Gin Met Leu Val Leu Pro Ala Leu Ala Leu Leu Ser 
20 25 30 

Ala Ser Gly Thr Gly Ser Ala Ala Gin Aso Asp Asp Phe Phe His Glu 
35 40 45 

Leu Pro Glu Thr Phe Pro Ser Asp Pro Pro Glu Pro Leu Pro His Phe 
50 55 60 

Leu He Glu Pro Glu Glu Ala Tyr He Val Lys Asn Lys Pro Val Asn 
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65 70 75 80 

Leu Tyr Cys Lys Ala Ser Pro Ala Thr Gin lie Tyr Phe Lys Cys Asn 

85 90 ' 95 

Ser Glu Trp Val His Gin Lys Asp His lie Val Asp Glu Arg Val Asp 
100 105 110 

Glu Thr Ser Gly Leu lie Val Arg Glu Val Ser lie Glu lie Ser Arg 
115 120 125 

Gin Gin Val Glu Glu Leu Phe Gly Pro Glu Asp Tyr Trp Cys Gin Cys 
130 135 ' * 140 

Val Ala Trp Ser Ser Ala Gly Thr Thr Lys Ser Arg Lys Ala Tyr Val 
145 150 155 160 

Arg lie Ala Tyr Leu Arg Lys Thr Phe Glu Gin Glu Pro Leu Gly Lys 
165 170 175 

Glu Val Ser Leu Glu Gin Glu Val Leu Leu Gin Cys Arg Pro Pro Glu 
180 185 190 

Gly He Pro Val Ala Glu Val Glu Trp Leu Lys Asn Glu Asp He He 
195 200 205 

Asp Pro Val Glu Asp Arg Asn Phe Tyr He Thr He Asp His Asn Leu 
210 215 220 

He He Lys Gin Ala Arg Leu Ser Asp Thr Ala Asn Tyr Thr Cys Val 
225 230 235 240 

Ala Lys Asn He Val Ala Lys Arg Lys Ser Thr Thr Ala Thr Val He 
245 250 255 

Val Tyr Val Asn Gly Gly Trp Ser Thr Trp Thr Glu Trp Ser Val Cys 
260 265 270 

Asn Ser Arg Cys Gly Arg Gly Tyr Gin Lys Arg Thr Arg Thr Cys Thr 
275 280 285 

Asn Pro Ala Pro Leu Asn Gly Gly Ala Phe Cys Glu Gly Gin Ser Val 
290 295 300 

Gin Lys He Ala Cys Thr Thr Leu Cys Pro Val Asp Gly Arg Trp Thr 
305 310 315 320 

Pro Trp Ser Lys Trp Ser Thr Cys Gly Thr Glu Cys Thr His Trp Arg 
325 330 335 

Arg Arg Glu Cys Thr Ala Pro Ala Pro Lys Asn Gly Gly Lys Asp Cys 
340 345 350 

Asp Gly Leu Val Leu Gin Ser Lys Asn Cys Thr Asp Gly Leu Cys Met 
355 360 365 

Gin Thr Ala Pro Asp Ser Asp Asp Val Ala Leu Tyr Val Gly He Val 
370 375 380 
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He Ala Val He Val Cys Leu Ala He Ser Val Val Val Ala Leu Phe 
385 390 395 400 

Val Tyr Arg Lys Asn His Arg Asp Phe Glu Ser Asp He He Asp Ser 
405 410 * 415 

Ser Ala Leu Asn Gly Gly Phe Gin Pro Val Asn He Lys Ala Ala Arg 
420 425 430 

Gin Asp Leu Leu Ala Val Pro Pro Asp Leu Thr Ser Ala Ala Ala Met 
435 440 445 

Tyr Arg Gly Pro Val Tyr Ala Leu His Asp Val Ser Asp Lys He Pro 
450 455 460 

Met Thr Asn Ser Pro lie Leu Asp Pro Leu Pro Asn Leu Lys He Lys 
465 470 475 480 

Val Tyr Asn Thr Ser Gly Ala Val Ser Pro Gin Asp Asp Leu Ser Glu 
485 490 * 495 

Phe Thr Ser Lys Leu Ser Pro Gin Met Thr Gin Ser Leu Leu Glu Asn 
500 505 510 

Glu Ala Leu Ser Leu Lys Asn Gin Ser Leu Ala Arg Gin Thr Asp Pro 
515 520 " 525 

Ser Cys Thr Ala Phe Gly Ser Phe Asn Ser Leu Gly Gly His Leu He 
530 535 540 

Val Pro Asn Ser Gly Val Ser Leu Leu He Pro Ala Gly Ala He Pro 
545 550 555 560 

Gin Gly Arg Val Tyr Glu Met Tyr Val Thr Val His Arg Lys Glu Thr 
565 570 575 

Met Arg Pro Pro Met Asp Asp Ser Gin Thr Leu Leu Thr Pro Val Val 
580 585 590 

Ser Cys Gly Pro Pro Gly Ala Leu Leu Thr Arg Pro Val Val Leu Thr 
595 600 605 

Met His His Cys Ala Asp Pro Asn Thr Glu Asp Trp Lys He Leu Leu 
610 615 620 

Lys Asn Gin Ala Ala Gin Gly Gin Trp Glu Asp Val Val Val Val Gly 
625 630 635 640 

Glu Glu Asn Phe Thr Thr Pro Cys Tyr He Lys Leu Asp Ala Glu Ala 
645 650 655 

Cys His He Leu Thr Glu Asn Leu Ser Thr Tyr Ala Leu Val Gly His 
660 665 670 



Ser Thr Thr Lys Ala Ala Ala Lys Arg Leu Lys Leu Ala He Phe Gly 
675 680 685 
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Pro Leu Cys Cys Ser Ser Leu Glu Tyr Ser lie Arg Val Tyr Cys Leu 
690 695 700 

Asp Asp Thr Gin Asp Ala Leu Lys Glu lie Leu His Leu Glu Arg Gin 
705 710 715 720 

Thr Gly Gly Gin Leu Leu Glu Glu Pro Lys Ala Leu His Phe Lys Gly 
725 730 735 

Ser Thr His Asn Leu Arg Leu Ser lie His Asp lie Ala His Ser Leu 
740 " 745 750 

Trp Lys Ser Lys Leu Leu Ala Lys Tyr Gin Glu lie Pro Phe Tyr His 
755 760 765 

Val Trp Ser Gly Ser Gin Arg Asn Leu His Cys Thr Phe Thr Leu Glu 
770 775 780 

Arg Phe Ser Leu Asn Thr Val Glu Leu Val Cys Lys Leu Cys Val Arg 
785 790 795 800 

Gin Val Glu Gly Glu Gly Gin lie Phe Gin Leu Asn Cys Thr Val Ser 
805 810 815 

Glu Glu Pro Thr Gly lie Asp Leu Pro Leu Leu Asp Pro Ala Asn Thr 
820 825 830 

lie Thr Thr Val Thr Gly Pro Ser Ala Phe Ser lie Pro Leu Pro lie 
835 840 845 

Arg Gin Lys Leu Cys Ser Ser Leu Asp Ala Pro Gin Thr Arg Gly His 
850 855 860 

Asp Trp Arg Met Leu Ala His Lys Leu Asn Leu Asp Arg Tyr Leu Asn 
865 870 875 880 

Tyr Phe Ala Thr Lys Ser Ser Pro Thr Gly Val lie Leu Asp Leu Trp 
885 890 895 

Glu Ala Gin Asn Phe Pro Asp Gly Asn Leu Ser Met Leu Ala Ala Val 
900 905 910 

Leu Glu Glu Met Gly Arg His Glu Thr Val Val Ser Leu Ala Ala Glu 
915 920 925 

Gly Gin Tyr 
930 



<210> 91 
<211> 9700 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 
PGC1037 
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<400> 91 

tatggccaca 

ctccacccag 

cttcaacttc 

cccagatgcc 

agacgtgagg 

accccctggc 

cagccctgac 

tgtgctgcac 

tgcctgctac 

cgtggctgcc 

cgagtacaac 

gcagctggag 

ggacagttac 

taagctcctt 

gtacttgcac 

caagctgtgg 

caccaaggac 

cctggtgggc 

cctggaccca 

ggacagccat 

gtgggaggcg 

actgggccag 

gcagccaagc 

tataaaaaaa 

cttgttcttg 

cgctcttatt 

tttcctgtgt 

aagtgtaaag 

ctgcccgctt 

gcggggagag 

cgctcggtcg 

tccacagaat 

aggaaccgta 

catcacaaaa 

caggcgtttc 

ggatacctgt 

aggtatctca 

gttcagcccg 

cacgacttat 

ggcggtgcta 

tttggtatct 

tccggcaaac 

cgcagaaaaa 

tggaacgaaa 

tagatccttt 

tggtctgaca 

cgttcatcca 

ccatctggcc 

tcagcaataa 

gcctccatcc 

agtttgcgca 

atggcttcat 

tgcaaaaaag 

gtgttatcac 

agatgctttt 

cgaccgagtt 

ttaaaagrgc 



ctgcaccaca 
aactacttcc 
ctcgggggcc 
ataccccgag 
ttgcccctag 
gtcccgctca 
agctggagcc 
ctgggcgagg 
gtcttcaccg 
gccaagcgcc 
atccgggtct 
aagcagctgg 
cacaacctgc 
gtcagctacc 
tgcaccttca 
gtgtggcagg 
acaaggtttg 
cccagtgcct 
ccctgtaggc 
ctcagcttct 
cggcacttcc 
ccagacgctg 
taattccggg 
ataagtgtat 
agtaactctt 
gaccacacct 
gaaattgtta 
cctggggtgc 
tccagtcggg 
gcggtttgcg 
ttcggctgcg 
caggggataa 
aaaaggccgc 
atcgacgctc 
cccctggaag 
ccgcctttct 
gttcggtgta 
accgctgcgc 
cgccactggc 
cagagttctt 
gcgctctgct 
aaaccaccgc 
aaggatctca 
actcacgtta 
taaattaaaa 
gttaccaatg 
tagttgcctg 
ccagtgctgc 
accagccagc 
agtctattaa 
acgttgttgc 
tcagctccgg 
cggttagctc 
tcatggttat 
ctgtgactgg 
gctcttgccc 
tcatcattgg 



gctctcccac 
gctccctgcc 
ggctgatgat 
ggaagatcta 
ctggctgtca 
cccggccagt 
tgcgcctcaa 
aggcgccctc 
agcagctggg 
tcaagctgct 
actgcctgca 
ggggacagct 
gcctatccat 
aggagatccc 
ccctggagcg 
tggagggcga 
ctgagctgct 
tcaagatccc 
ggggtgccga 
ttgcctccaa 
ccaacggcaa 
gcctcttcac 
cgaatttctt 
acaaatttta 
tcctgtaggt 
ctaccggcat 
tccgctcaca 
ctaatgagtg 
aaacctgtcg 
tattgggcgc 
gcgagcggta 
cgcaggaaag 
gttgctggcg 
aagtcagagg 
ctccctcgtg 
cccttcggga 
ggtcgttcgc 
cttatccggt 
agcagccact 
gaagtggtgg 
gaagccagtt 
tggtagcggt 
agaagatcct 
agggattttg 
atgaagtttt 
cttaatcagt 
actccccgtc 
aatgataccg 
cggaagggcc 
ttgttgccgg 
cattgctaca 
ttcccaacga 
cttcggtcct 
ggcagcactg 
tgagtactca 
ggcgtcaata 
aaaacgttct 



129 

ctctgaggcc 
ccgaggcacc 
ccctaataca 
tgagatctac 
gaccctgctg 
catccrggcr 
aaagcagtcg 
ccacctctac 
ccgctttgcc 
tctgtttgcg 
tgacacccac 
gatccaggag 
ccacgatgtg 
cttttatcac 
tgtcagcccc 
cgggcagagc 
ggctctggag 
cttcctcatt 
ctggcggact 
gcccagcccc 
cctcagccag 
agtgtcggag 
atgatttatg 
aagtgactct 
caggttgctt 
gcaagcttgg 
attccacaca 
aggtaactca 
tgccagctgg 
tcttccgctt 
tcagctcact 
aacatgtgag 
tttttccata 
tggcgaaacc 
cgctctcctg 
agcgtggcgc 
tccaagctgg 
aactatcgtc 
ggtaacagga 
cctaactacg 
accttcggaa 
ggtttttttg 
ttgatctttt 
gtcatgagat 
aaatcaatct 
gaggcaccta 
gtgtagataa 
cgagacccac 
gagcgcagaa 
gaagctagag 
ggcatcgtgg 
tcaaggcgag 
ccgatcgttg 
cataattctc 
accaagtcat 
cgggataata 
tcggggcgaa 



gaggagttcg 
agcaacatga 
ggaatcagcc 
ctcacgctgc 
agtcccatcg 
atggaccact 
tgcgagggca 
tactgccagc 
ctggtgggag 
ccggtggcct 
gatgcactca 
ccacgggtcc 
cccagctccc 
atctggaatg 
agcactagtg 
ttcagcatca 
agtgaagcgg 
cggcagaaga 
ctggcccaga 
acagccatga 
ctggctgcag 
gctgagtgct 
atttttatta 
taggttttaa 
tctcaggtat 
cgtaatcatg 
acatacgagc 
cattaattgc 
attaatgaat 
cctcgctcac 
caaaggcggt 
caaaaggcca 
ggctccgccc 
cgacaggact 
ttccgaccct 
tttctcatag 
gctgtgtgca 
ttgagtccaa 
ttagcagagc 
gctacactag 
aaagagttgg 
tttgcaagca 
ctacggggtc 
tatcaaaaag 
aaagtatata 
tctcagcgat 
ctacgatacg 
gctcaccggc 
gtggtcctgc 
taagtagttc 
tgtcacgctc 
ttacatgatc 
tcagaagtaa 
ttactgtcat 
tctgagaata 
ccgcgccaca 
aactctcaag 



tctcccgcct 
cctatgggac 
tcctcatccc 
acaagccgga 
ttagctgtgg 
gtggggagcc 
gctgggagga 
tggaggccag 
aggccctcag 
gcacctccct 
aggaggtggt 
tgcacttcaa 
tgtggaagag 
gcacgcagcg 
acctggcctg 
acttcaacat 
gggtccaagc 
taatttccag 
aactccacct 
tcctcaacct 
cagtggctgg 
gagtcgacct 
ttaaataagt 
aacgaaaatt 
agcatgaggt 
gtcatagctg 
cggaagcata 
gttgcgctca 
cggccaacgc 
tgactcgctg 
aatacggtta 
gcaaaaggcc 
ccctgacgag 
ataaagatac 
gccgcttacc 
ctcacgctgt 
cgaacccccc 
cccggtaaga 
gaggtatgta 
aaggacagta 
tagctcttga 
gcagattacg 
tgacgctcag 
gatcttcacc 
tgagtaaact 
ctgtctattt 
ggagggctta 
tccagattta 
aactttatcc 
gccagttaat 
gtcgtttggt 
ccccatgttg 
gttggccgca 
gccatccgta 
gtgtatgcgg 
tagcagaact 
gatcttaccg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 
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ctgttgagat 
actttcacca 
ataagggcga 
atttatcagg 
caaatagggg 
tcattttgta 
ctgcattttt 
tgcttcattt 
ctgagctgca 
atctatactt 
gcatcttaga 
tttgcactgt 
ataaaaaaag 
ttttttcaag 
tgtgaacaga 
cttctatttt 
attcactcta 
aacataaaaa 
taggttatat 
tgtggaagcg 
tttgaaagtg 
ttctagagaa 
cgaaaatgca 
gtgttgcctg 
gcgtacttat 
tatcccattc 
atgctgccac 
tggatcatat 
gaggcccttt 
cccggagacg 
cgcgtcagcg 
tgtactgaga 
aatagacagc 
aagatattct 
tctttttttg 
gagggcattg 
ggagtatgtc 
tgcagagcac 
tatgtgtgcc 
ttgtaaaagc 
atcaacctaa 
tgcatggaga 
gactcgtatt 
cgtttattcc 
cgatttctga 
gtggactgac 
atgtaagcgg 
tcaaaaatgc 
cttgccgatc 
ggaaattgta 
attttttaac 
gatagggttg 
caacgtcaaa 
ctaatcaagt 
cccccgattt 
agcgaaagga 
cacacccgcc 
gcgcaactgt 



ccagttcgat 
gcgtttctgg 
cacggaaatg 
gttattgtct 
ttccgcgcac 
gaacaaaaat 
acagaacaga 
ttgtaaaaca 
tttttacaga 
cttttttgtt 
ttactttttt 
aggtccgtta 
cctgactcca 
ataaaggcat 
aagtgatagc 
gtctctatat 
tgaatagttc 
atgtagaggt 
agggatatag 
gtattcgcaa 
cgtcttcaga 
taggaacttc 
acgcgagctg 
tatatatata 
atgcgtctat 
catgcggggt 
tcctcaattg 
taagaaacca 
cgtctcgcgc 
gtcacagctt 
ggtgttggcg 
gtgcaccata 
atcgtaatat 
ttattgaaaa 
ccgattaaga 
gtgactattg 
tgttattaat 
agaggccgca 
caatagaaag 
atataaaaat 
ggaggatgtt 
tgagtcgtgg 
tccaaaagac 
cttgtttgat 
ctgggttgga 
gccagaaaat 
aggtgtggag 
taagaaatag 
tatgcggtgt 
aacgttaata 
caataggccg 
agtgttgttc 
gggcgaaaaa 
tttttggggt 
agagcttgac 
gcgggcgcta 
gcgcttaatg 
tgggaagggc 



gtaacccact 
gtgagcaaaa 
ttgaatactc 
catgagcgga 
atttccccga 
gcaacgcgag 
aatgcaacgc 
aaaatgcaac 
acagaaatgc 
ctacaaaaat 
tctcctttgt 
aggttagaag 
cttcccgcgt 
ccccgattat 
gttgatgatt 
actacgtata 
ttactacaat 
cgagtttaga 
cacagagata 
tattttagta 
gcgcttttgg 
ggaataggaa 
cgcacataca 
tacatgagaa 
ttatgtagga 
atcgtatgct 
gattagtctc 
ttattatcat 
gtttcggtga 
gtctgtaagc 
ggtgtcgggg 
gatcaacgac 
atgtgtactt 
atagcttgtc 
attaattcgg 
agcacgtgag 
ttcacaggta 
gaatgtgctc 
agaacaattg 
agttcaggca 
ttggctctgg 
caagaatacc 
tgcaacatac 
tcagaagcag 
aggcaagaga 
gttggtgatg 
acaaatggtg 
gttattactg 
gaaataccgc 
ttttgttaaa 
aaatcggcaa 
cagtttggaa 
ccgtctatca 
cgaggtgccg 
ggggaaagcc 
gggcgctggc 
cgccgctaca 
gatcggtgcg 



130 
cgtgcaccca 
acaggaaggc 
atactcttcc 
tacatatttg 
aaagtgccac 
agcgctaatt 
gaaagcgcta 
gcgagagcgc 
aacgcgagag 
gcatcccgag 
gcgctctata 
aaggctactt 
ttactgatta 
attctatacc 
cttcattggt 
ggaaatgttt 
ttttttgtct 
tgcaagttca 
tatagcaaag 
gctcgttaca 
ttttcaaaag 
cttcaaagcg 
gctcactgtt 
gaacggcata 
tgaaaggtag 
tccttcagca 
atccttcaat 
gacattaacc 
tgacggtgaa 
ggatgccggg 
ctggcttaac 
attactatat 
tgcagttatg 
accttacgta 
tcgaaaaaag 
tatacgtgat 
gttctggtcc 
tagattccga 
acccggttat 
ctccgaaata 
tcaatgatta 
aagagttcct 
tactcagtgc 
gtgggacagg 
gccccgaaag 
cgcttagatt 
taaaagactc 
agtagtattt 
acagatgcgt 
attcgcgtta 
aatcccttat 
caagagtcca 
gggcgatggc 
taaagcacta 
ggcgaacgtg 
aagtgtagcg 
gggcgcgtcg 
ggcctcttcg 



actgatcttc 
aaaatgccgc 
tttttcaata 
aatgtattta 
ctgaacgaag 
tttcaaacaa 
ttttaccaac 
taatttttca 
cgctatttta 
agcgctattt 
atgcagtctc 
tggtgtctat 
ctagcgaagc 
gatgtggatt 
cagaaaatta 
acattttcgt 
aaagagtaat 
aggagcgaaa 
agatactttt 
gtccggtgcg 
cgctctgaag 
tttccgaaaa 
cacgtcgcac 
gtgcgtgttt 
tctagtacct 
ctacccttta 
gctatcattt 
tataaaaata 
aacctctgac 
agcagacaag 
tatgcggcat 
atataatata 
acgccagatg 
caatcttgat 
aaaaggagag 
taagcacaca 
attggtgaaa 
tgctgacttg 
tgcaaggaaa 
cttggttggc 
cggcattgat 
cggtttgcca 
agcttcacag 
tgaacttttg 
cttacatttt 
aaatggcgtt 
taacaaaata 
atttaagtat 
aaggagaaaa 
aatttttgtt 
aaatcaaaag 
ctattaaaga 
ccactacgtg 
aatcggaacc 
gcgagaaagg 
gtcacgctgc 
cgccattcgc 
ctattacgcc 



agcatctttt 
aaaaaaggga 
ttattgaagc 
gaaaaataaa 
catctgtgct 
agaatctgag 
gaagaatctg 
aacaaagaat 
ccaacaaaga 
ttctaacaaa 
ttgataactt 
tttctcttcc 
tgcgggtgca 
gcgcatactt 
tgaacggttt 
attgttttcg 
actagagata 
ggtggatggg 
gagcaatgtt 
tttttggttt 
ttcctatact 
cgagcgcttc 
ctatatctgc 
atgcttaaat 
cctgtgatat 
gctgttctat 
cctttgatat 
ggcgtatcac 
acatgcagct 
cccgtcaggg 
cagagcagat 
ggaagcattt 
gcagtagtgg 
ccggagcttt 
ggccaagagg 
aaggcagctt 
gtttgcggct 
ctgggtatta 
atttcaagtc 
gtgtttcgta 
atcgtccaac 
gttattaaaa 
aaacctcatt 
gattggaact 
atgttagctg 
attggtgttg 
gcaaatttcg 
tgtttgtgca 
taccgcatca 
aaatcagctc 
aatagaccga 
acgtggactc 
aaccatcacc 
ctaaagggag 
aagggaagaa 
gcgtaaccac 
cattcaggct 
agctggcgaa 



3480 
354G 
360C 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
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agggggatgt gctgcaaggc gattaagttg ggtaacgcca gggttttccc agtcacgacg 6960 
ttgtaaaacg acggccagtc gtccaagctt tcgcgagctc gagatcccga gctttgcaaa 7020 
ttaaagcctt cgagcgtccc aaaaccttct caagcaaggt tttcagtata atgttacatg 7080 
cgtacacgcg tctgtacaga aaaaaaagaa aaatttgaaa tataaataac gttcttaata 7140 
ctaacataac tataaaaaaa taaataggga cctagacttc aggttgtcta actccttcct 7200 
tttcggttag agcggatgtg gggggagggc gtgaatgtaa gcgtgacata actaattaca 7260 
3.3Ciggcaaag gggcctgttt aCtcacayyC "c t_ *c L L L u d a y caggtaatta 7320 
agtcgtttct gtctttttcc ttcttcaacc caccaaaggc catcttggta cttttttttt 7380 
tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt 7440 
tttttttttt tttttttttt tttttcatag aaataataca gaagtagatg ttgaattaga 7500 
ttaaactgaa gatatataat ttattggaaa atacatagag ctttttgttg atgcgcttaa 7560 
gcgatcaatt caacaacacc accagcagct ctgatttttt cttcagccaa cttggagacg 7620 
aatctagctt tgacgataac tggaacattt ggaattctac ccttacccaa gatcttaccg 7680 
taaccggctg ccaaagtgtc aataactgga gcagtttcct tagaagcaga tttcaagtat 7740 
tggtctctct tgtcttctgg gatcaatgtc cacaatttgt ccaagttcaa gactggcttc 7800 
cagaaatgag cttgttgctt gtggaagtat ctcataccaa nccttaccga aataacctgg 7860 
atggtattta tccatgttaa ttctgtggtg atgttgacca ccggccatac ctctaccacc 7920 
ggggtgcttt ctgtgcttac cgatacgacc tttaccggct gagacgtgac ctctgtgctt 7980 
tctagtctta gtgaatctgg aaggcattct tgattagttg gatgattgtt ctgggattta 8040 
atgcaaaaaa atcactaaga aggaaaaaaa tcaacggaga aagcaaacgc catcttaaat 8100 
atacgggata cagatgaaag gtttgaacct atctgggaaa atacgcatta aacaagcgaa 8160 
aaactgcgag gaaaattgtt tgcgtctctg cgggctattc acgcgccaga ggaaaatagg 8220 
aaaaataaca gggcattaga aaaataattt tgattttggt aatgtgtggg tccctggtgt 8280 
acagatgtta cattggttac agtactcttg tttttgctgt gtttttcgat gaatctccaa 8340 
aatggttgtt agcacatgga agagtcaccg atgctaagtt atctctatgt aagctacgtg 8400 
gcgtgacttt tgatgaagcc gcacaagaga tacaggattg gcaactgcaa atagaatctg 8460 
gggatctaga tatccttttg ttgtttccgg gtgtacaata tggacttcct cttttctggc 8520 
aaccaaaccc atacatcggg attcctataa taccttcgtt ggtctcccta acatgtaggt 8580 
ggcggagggg agatatacaa tagaacagat accagacaag acataatggg ctaaacaaga 8 640 
ctacaccaat tacactgcct cattgatggt ggtacataac gaactaatac tgtagcccta 8700 
gacttgatag ccatcatcat atcgaagttt cactaccctt tttccatttg ccatctattg 8760 
aagtaataat aggcgcatgc aacttctttt cttttttttt cttttctctc tcccccgttg 8820 
ttgtctcacc atatccgcaa tgacaaaaaa aatgatggaa gacactaaag gaaaaaatta 8880 
acgacaaaga cagcaccaac agatgtcgtt gttccagagc tgatgagggg tatcttcgaa 8940 
cacacgaaac tttttccttc cttcattcac gcacactact ctctaatgag caacggtata 9000 
cggccttcct tccagttact tgaatttgaa ataaaaaaag tttgccgctt tgctatcaag 9060 
tataaataga cctgcaatta ttaatctttt gtttcctcgt cattgttctc gttccctttc 9120 
ttccttgttt ctttttctgc acaatatttc aagctatacc aagcatacaa tcaactccaa 9180 
gcttgaagca agcctcctga aagatgaagc tactgtcttc tatcgaacaa gcatgcgata 9240 
tttgccgact taaaaagctc aagtgctcca aagaaaaacc gaagtgcgcc aagtgtctga 9300 
agaacaactg ggagtgtcgc tactctccca aaaccaaaag gtctccgctg actagggcac 9360 
atctgacaga agtggaatca aggctagaaa gactggaaca gctatttcta ctgatttttc 9420 
ctcgagaaga ccttgacatg attttgaaaa tggattcttt acaggatata aaagcattgt 9480 
taacaggatt atttgtacaa gataatgtga ataaagatgc cgtcacagat agattggctt 9540 
cagtggagac tgatatgcct ctaacattga gacagcatag aataagtgcg acatcatcat 9600 
cggaagagag tagtaacaaa ggtcaaagac agttgactgt atcgccggaa ttcatggctt 9660 
acccatacga tgttccagat tacgctagct tgggtggtca " 9700 

<210> 92 
<211> 9749 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 
PGC1003 

<400> 92 
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agaatctggg 

tttctggcaa 

atgtaggtgg 

aaacaagact 

tagccctaga 

atctattgaa 

ccccgttgtt 

aaaaattaac 

tcttcgaaca 

acggtatacg 

ctatcaagta 

tccctttctt 

aactccaagc 

atgcgatatt 

gtgtctgaag 

tagggcacat 

gatttttcct 

agcattgtta 

attggcttca 

atcatcatcg 

catggcttac 

gtgctgtaaa 

ggaaaaagca 

gcatcaacat 

tcctcctcta 

ttcaacaaga 

aggatctggt 

taattatgct 

acaaaatatt 

tggagcaaga 

tgtttcggac 

agtcatcgtc 

tcctgttgtt 

cacattgtat 

cttgaatacc 

atggtgccac 

aaattctttg 

agcttatcga 

ggaaagtgta 

tttaaatgaa 

cgatggccct 

ccttcattgc 

agtaattgtc 

gccggagctc 

atttcttatg 

aattttaaag 

tgtaggtcag 

ccggcatgca 

gctcacaatt 

atgagtgagg 

cctgtcgtgc 

tgggcgctct 

agcggtatca 

aggaaagaac 

gctggcgttt 

tcagaggtgg 

cctcgtgcgc 

ttcgggaagc 



gatctagata 
ccaaacccat 
cggaggggag 
acaccaatta 
cttgatagcc 
gtaataatag 
gtctcaccat 
gacaaagaca 
cacgaaactt 
gccttccttc 
taaatagacc 
ccttgtttct 
ttgaagcaag 
tgccgactta 
aacaactggg 
ctgacagaag 
cgagaagacc 
acaggattat 
gtggagactg 
gaagagagta 
ccatacgatg 
cgtggcaatt 
ggaggaatct 
ggaacattgc 
ccacattcaa 
aacgctggct 
ggaaaacgga 
acgttatatg 
gttgctgcgc 
ttaattgtac 
accttaacag 
atcggacaat 
gtatccttcc 
gctgatgaag 
aacatgtttg 
gttatgacat 
agtgctgcaa 
agaccatttg 
tggaaacagg 
aaaggaaatc 
gaagttgtgg 
tcattgaaat 
tatcaaaagg 
tatgatgcag 
atttatgatt 
tgactcttag 
gttgctttct 
agcttggcgt 
ccacacaaca 
taactcacat 
cagctggatt 
tccgcttcct 
gctcactcaa 
atgtgagcaa 
ttccataggc 
cgaaacccga 
tctcctgttc 
gtggcgcttt 



tccttttgtt 
acatcgggat 
atatacaata 
cactgcctca 
atcatcatat 
gcgcatgcaa 
atccgcaatg 
gcaccaacag 
tttccttcct 
cagttacttg 
tgcaattatt 
ttttctgcac 
cctcctgaaa 
aaaagctcaa 
agtgtcgcta 
tggaatcaag 
ttgacatgat 
ttgtacaaga 
atatgcctct 
gtaacaaagg 
ttccagatta 
caaaaaagtc 
actattctga 
tcggagagaa 
caactcttcg 
cacgagctgc 
caatgcttag 
attatatgga 
agattgatag 
ctgaattggc 
atcaacctca 
gtgatgtttc 
gtcattgtgc 
gatccggatg 
tacaattcga 
attccttagc 
aaagagttca 
aattgagagt 
aagatggatc 
tttgcatttg 
aaatctcgga 
tccgaccaaa 
caagctcaac 
ccccggggcg 
tttattatta 
gttttaaaac 
caggtatagc 
aatcatggtc 
tacgagccgg 
taattgcgtt 
aatgaatcgg 
cgctcactga 
aggcggtaat 
aaggccagca 
tccgcccccc 
caggactata 
cgaccctgcc 
ctcatagctc 
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gtttccgggt 
tcctataata 
gaacagatac 
ttgatggtgg 
cgaagtttca 
cttcttttct 
acaaaaaaaa 
atgtcgttgt 
tcattcacgc 
aatttgaaat 
aatcttttgt 
aatatttcaa 
gatgaagcta 
gtgctccaaa 
ctctcccaaa 
gctagaaaga 
tttgaaaatg 
taatgtgaat 
aacattgaga 
tcaaagacag 
cgctagcttg 
gaaaccattg 
gcctccaggt 
aatatcatcg 
atctggaaaa 
tttaattcaa 
aacttcaagc 
agacaagagt 
caatggagca 
agtggaagga 
cctcaaacca 
aatgtcagct 
ttcaactttt 
gcagaaggct 
gcaacctggg 
gaggttaatg 
tttagctgta 
ttattgtgtt 
tcgtctactt 
tattgaagat 
aacacaacac 
ggaaatcaac 
ggagccaatg 
tcgacctgca 
aataagttat 
gaaaattctt 
atgaggtcgc 
atagctgttt 
aagcataaag 
gcgctcactg 
ccaacgcgcg 
ctcgctgcgc 
acggttatcc 
aaaggccagg 
tgacgagcat 
aagataccag 
gcttaccgga 
acgctgtagg 



gtacaatatg 
ccttcgttgg 
cagacaagac 
tacataacga 
ctaccctttt 
ttttttttct 
tgatggaaga 
tccagagctg 
acactactct 
aaaaaaagtt 
ttcctcgtca 
gctataccaa 
ctgtcttcta 
gaaaaaccga 
accaaaaggt 
ctggaacagc 
gattctttac 
aaagatgccg 
cagcatagaa 
ttgactgtat 
ggtggtcata 
aaaccgcaga 
gttcgaagat 
tgttctcaat 
agtgcattca 
gaatgtagtt 
tcaaattgct 
gttcttggat 
cgattgtctt 
gagaaaatgc 
atcgaatccg 
catgataata 
ccacgtgaca 
gtaacaattg 
aagaagaatg 
ctagctggtc 
tttggtccaa 
cctgaaactg 
tgtgaatcaa 
gttattcctg 
agatttgttg 
ggctcacagt 
gtgatggaag 
gccaagctaa 
aaaaaaaata 
gttcttgagt 
tcttattgac 
cctgtgtgaa 
tgtaaagcct 
cccgctttcc 
gggagaggcg 
tcggtcgttc 
acagaatcag 
aaccgtaaaa 
cacaaaaatc 
gcgtttcccc 
tacctgtccg 
tatctcagtt 



gacttcctct 
tctccctaac 
ataatgggct 
actaatactg 
tccatttgcc 
tttctctctc 
cactaaagga 
atgaggggta 
ctaatgagca 
tgccgctttg 
ttgttctcgt 
gcatacaatc 
tcgaacaagc 
agtgcgccaa 
ctccgctgac 
tatttctact 
aggatataaa 
tcacagatag 
taagtgcgac 
cgccggaatt 
tggccatgga 
aaatgaattc 
tacttttgga 
attttgagcc 
gtggatattc 
cgagtagttc 
cggatgacga 
tagacacttc 
tatcaaaaag 
tttatctggc 
ctctctcccc 
ttctccgtcg 
attggcaatt 
gagaagagaa 
atggatttgg 
atccgaggag 
ctgaaatgtc 
gagcagcaat 
atgattttat 
gattctcatg 
ctcaaaatgg 
tttcaacgag 
tgagcaatga 
ttccgggcga 
agtgtataca 
aactctttcc 
cacacctcta 
attgttatcc 
ggggtgccta 
agtcgggaaa 
gtttgcgtat 
ggctgcggcg 
gggataacgc 
aggccgcgtt 
gacgctcaag 
ctggaagctc 
cctttctccc 
cggtgtaggt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 
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cgttcgctcc aagctgggct gtgtgcacga 
atccggtaac tatcgtcttg agtccaaccc 
agccactggt aacaggatta gcagagcgag 
gtggtggcct aactacggct acactagaag 
gccagttacc ttcggaaaaa gagttggtag 
tagcggtggt ttttttgttt gcaagcagca 
agatcctttg atcttttcta cggggtctga 
gattttggtc atgagattat caaaaaggat 
aagttttaaa tcaatctaaa gtatatatga 
aatcagtgag gcacctatct cagcgatctg 
ccccgtcgtg tagataacta cgatacggga 
gataccgcga gacccacgct caccggctcc 
aagggccgag cgcagaagtg gtcctgcaac 
ttgccgggaa gctagagtaa gtagttcgcc 
tgctacaggc atcgtggtgt cacgctcgtc 
ccaacgatca aggcgagtta catgatcccc 
cggtcctccg atcgttgtca gaagtaagtt 
agcactgcat aattctctta ctgtcatgcc 
gtactcaacc aagtcattct gagaatagtg 
gtcaatacgg gataataccg cgccacatag 
acgttcttcg gggcgaaaac tctcaaggat 
acccactcgt gcacccaact gatcttcagc 
agcaaaaaca ggaaggcaaa atgccgcaaa 
aatactcata ctcttccttt ttcaatatta 
gagcggatac atatttgaat gtatttagaa 
tccccgaaaa gtgccacctg aacgaagcat 
acgcgagagc gctaattttt caaacaaaga 
gcaacgcgaa agcgctattt taccaacgaa 
atgcaacgcg agagcgctaa tttttcaaac 
gaaatgcaac gcgagagcgc tattttacca 
caaaaatgca tcccgagagc gctatttttc 
cctttgtgcg ctctataatg cagtctcttg 
ttagaagaag gctactttgg tgtctatttt 
cccgcgttta ctgattacta gcgaagctgc 
cgattatatt ctataccgat gtggattgcg 
gatgattctt cattggtcag aaaattatga 
acgtatagga aatgtttaca ttttcgtatt 
ctacaatttt tttgtctaaa gagtaatact 
gtttagatgc aagttcaagg agcgaaaggt 
agagatatat agcaaagaga tacttttgag 
tttagtagct cgttacagtc cggtgcgttt 
cttttggttt tcaaaagcgc tctgaagttc 
ataggaactt caaagcgttt ccgaaaacga 
acatacagct cactgttcac gtcgcaccta 
atgagaagaa cggcatagtg cgtgtttatg 
tgtaggatga aaggtagtct agtacctcct 
gtatgcttcc ttcagcacta ccctttagct 
tagtctcatc cttcaatgct atcatttcct 
ttatcatgac attaacctat aaaaataggc 
tcggtgatga cggtgaaaac ctctgacaca 
tgtaagcgga tgccgggagc agacaagccc 
gtcggggctg gcttaactat gcggcatcag 
caacgacatt actatatata taatatagga 
tgtactttgc agttatgacg ccagatggca 
gcttgtcacc ttacgtacaa tcttgatccg 
aattcggtcg aaaaaagaaa aggagagggc 
acgtgagtat acgtgattaa gcacacaaag 
acaggtagtt ctggtccatt ggtgaaagtt 
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accccccgtt cagcccgacc gctgcgcctt 3540 
ggtaagacac gacttatcgc cactggcagc 3600 
gtatgtaggc ggtgctacag agttcttgaa 3660 
gacagtattt ggtatctgcg ctctgctgaa 3720 
ctcttgatcc ggcaaacaaa ccaccgctgg 3780 
gattacgcgc agaaaaaaag gatctcaaga 3840 
cgctcagtgg aacgaaaact cacgttaagg 3900 
cttcacctag atccttttaa attaaaaatg 3960 
gtaaacttgg tctgacagtt accaatgctt 4020 
tctatttcgt tcatccatag ttgcctgact 4080 
gggcttacca tctggcccca gtgctgcaat 4140 
agatttatca gcaataaacc agccagccgg 4200 
tttatccgcc tccatccagt ctattaattg 4260 
agttaatagt ttgcgcaacg ttgttgccat 4320 
gtttggtatg gcttcattca gctccggttc 4380 
catgttgtgc aaaaaagcgg ttagctcctt 4440 
ggccgcagtg ttatcactca tggttatggc 4500 
atccgtaaga tgcttttctg tgactggtga 4560 
tatgcggcga ccgagttgct cttgcccggc 4 620 
cagaacttta aaagtgctca tcattggaaa 4680 
cttaccgctg ttgagatcca gttcgatgta 4740 
atcttttact ttcaccagcg tttctgggtg 4800 
aaagggaata agggcgacac ggaaatgttg 4860 
ttgaagcatt tatcagggtt attgtctcat 4920 
aaataaacaa ataggggttc cgcgcacatt 4980 
ctgtgcttca ttttgtagaa caaaaatgca 5040 
atctgagctg catttttaca gaacagaaat 5100 
gaatctgtgc ttcatttttg taaaacaaaa 5160 
aaagaatctg agctgcattt ttacagaaca 5220 
acaaagaatc tatacttctt ttttgttcta 5280 
taacaaagca tcttagatta ctttttttct 5340 
ataacttttt gcactgtagg tccgttaagg 5400 
ctcttccata aaaaaagcct gactccactt 5460 
gggtgcattt tttcaagata aaggcatccc 5520 
catactttgt gaacagaaag tgatagcgtt 5580 
acggtttctt ctattttgtc tctatatact 5640 
gttttcgatt cactctatga atagttctta 5700 
agagataaac ataaaaaatg tagaggtcga 5760 
ggatgggtag gttatatagg gatatagcac 5820 
caatgtttgt ggaagcggta ttcgcaatat 5880 
ttggtttttt gaaagtgcgt cttcagagcg 5940 
ctatactttc tagagaatag gaacttcgga 6000 
gcgcttccga aaatgcaacg cgagctgcgc 6060 
tatctgcgtg ttgcctgtat atatatatac 6120 
cttaaatgcg tacttatatg cgtctattta 6180 
gtgatattat cccattccat gcggggtatc 6240 
gttctatatg ctgccactcc tcaattggat 6300 
ttgatattgg atcatattaa gaaaccatta 6360 
gtatcacgag gccctttcgt ctcgcgcgtt 6420 
tgcagctccc ggagacggtc acagcttgtc 6480 
gtcagggcgc gtcagcgggt gttggcgggt 6540 
agcagattgt actgagagtg caccatagat 6600 
agcatttaat agacagcatc gtaatatatg 6660 
gtagtggaag atattcttta ttgaaaaata 6720 
gagcttttct ttttttgccg attaagaatt 6780 
caagagggag ggcattggtg actattgagc 6840 
gcagcttgga gtatgtctgt tattaatttc 6900 
tgcggcttgc agagcacaga ggccgcagaa 6960 
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tgtgctctag 
acaattgacc 
tcaggcactc 
gctctggtca 
gaataccaag 
aacatactac 
gaagcaggtg 
caagagagcc 
ggtgatgcgc 
aatggtgtaa 
attactgagt 
ataccgca ca 
tgttaaaatt 
tcggcaaaat 
tttggaacaa 
tctatcaggg 
ggtgccgtaa 
gaaagccggc 
cgctggcaag 
cgctacaggg 
cggtgcgggc 
taagttgggt 
caagctttcg 
accttctcaa 
aaaagaaaaa 
atagggacct 
ggagggcgtg 
cctgtttact 
ttcaacccac 
tttttttttt 
ttcatagaaa 
ttggaaaata 
agcagctctg 
aacatttgga 
aactggagca 
caatgtccac 
gaagtatctc 
tgtggtgatg 
tacgaccttt 
gcattcttga 
aaaaaaatca 
tgaacctatc 
gtctctgcgg 
ataattttga 
actcttgttt 
gtcaccgatg 
caagagatac 



attccgatgc 
cggttattgc 
cgaaatactt 
atgattacgg 
agttcctcgg 
tcagtgcagc 
ggacaggtga 
ccgaaagctt 
ttagattaaa 
aagactctaa 
agtatttatt 
gatgcgtaag 
cgcgttaaat 
cccttataaa 
gagtccacta 
cgatggccca 
agcactaaat 
gaacgtggcg 
tgtagcggtc 
cgcgtcgcgc 
ctcttcgcta 
aacgccaggg 
cgagctcgag 
gcaaggtttt 
tttgaaatat 
agacttcagg 
aatgtaagcg 
cacaggcttt 
caaaggccat 
tttttttttt 
taatacagaa 
catagagctt 
attttttctt 
attctaccct 
gtttccttag 
aatttgtcca 
ataccaancc 
ttgaccaccg 
accggctgag 
ttagttggat 
acggagaaag 
tgggaaaata 
gctattcacg 
ttttggtaat 
ttgctgtgtt 
ctaagttatc 
aggattggca 



tgacttgctg 
aaggaaaatt 
ggttggcgtg 
cattgatatc 
tttgccagtt 
ttcacagaaa 
acttttggat 
acattttatg 
tggcgttatt 
caaaatagca 
taagtattgt 
gagaaaatac 
ttttgttaaa 
tcaaaagaat 
ttaaagaacg 
ctacgtgaac 
cggaacccta 
agaaaggaag 
acgctgcgcg 
cattcgccat 
ttacgccagc 
ttttcccagt 
atcccgagct 
cagtataatg 
aaataacgtt 
ttgtctaact 
tgacataact 
tttcaagtag 
cttggtactt 
tttttttttt 
gtagatgttg 
tttgttgatg 
cagccaactt 
tacccaagat 
aagcagattt 
agttcaagac 
ttaccgaaat 
gccatacctc 
acgtgacctc 
gattgttctg 
caaacgccat 
cgcattaaac 
cgccagagga 
gtgtgggtcc 
tttcgatgaa 
tctatgtaag 
actgcaaat 
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ggtattatat 
tcaagtcttg 
tttcgtaatc 
gtccaactgc 
attaaaagac 
cctcattcgt 
tggaactcga 
ttagctggtg 
ggtgttgatg 
aatttcgtca 
ttgtgcactt 
cgcatcagga 
tcagctcatt 
agaccgagat 
tggactccaa 
catcacccta 
aagggagccc 
ggaagaaagc 
taaccaccac 
tcaggctgcg 
tggcgaaagg 
cacgacgttg 
ttgcaaatta 
ttacatgcgt 
cttaatacta 
ccttcctttt 
aattacatga 
gtaattaagt 
tttttttttt 
tttttttttt 
aattagatta 
cgcttaagcg 
ggagacgaat 
cttaccgtaa 
caagtattgg 
tggcttccag 
aacctggatg 
taccaccggg 
tgtgctttct 
ggatttaatg 
cttaaatata 
aagcgaaaaa 
aaataggaaa 
ctggtgtaca 
tctccaaaat 
ctacgtggcg 



gtgtgcccaa 
taaaagcata 
aacctaagga 
atggagatga 
tcgtatttcc 
ttattccctt 
tttctgactg 
gactgacgcc 
taagcggagg 
aaaatgctaa 
gccgatctat 
aattgtaaac 
ttttaaccaa 
agggttgagt 
cgtcaaaggg 
atcaagtttt 
ccgatttaga 
gaaaggagcg 
acccgccgcg 
caactgttgg 
gggatgtgct 
taaaacgacg 
aagccttcga 
acacgcgtct 
acataactat 
cggttagagc 
tatcgacaaa 
cgtttctgtc 
tttttttttt 
tttttttttt 
aactgaagat 
atcaattcaa 
ctagctttga 
ccggctgcca 
tctctcttgt 
aaatgagctt 
gtatttatcc 
gtgctttctg 
agtcttagtg 
caaaaaaatc 
cgggatacag 
ctgcgaggaa 
aataacaggg 
gatgttacat 
ggttgttagc 
tgacttttga 



tagaaagaga 
taaaaatagt 
ggatgttttg 
gtcgtggcaa 
aaaagactgc 
gtttgattca 
ggttggaagg 
agaaaatgtt 
tgtggagaca 
gaaataggtt 
gcggtgtgaa 
gttaatattt 
taggccgaaa 
gttgttccag 
cgaaaaaccg 
ttggggtcga 
gcttgacggg 
ggcgctaggg 
cttaatgcgc 
gaagggcgat 
gcaaggcgat 
gccagtcgtc 
gcgtcccaaa 
gtacagaaaa 
aaaaaaataa 
ggatgtgggg 
ggaaaagggg 
tttttccttc 
tttttttttt 
tttttttttt 
atataattta 
caacaccacc 
cgataactgg 
aagtgtcaat 
cttctgggat 
gttgcttgtg 
atgttaattc 
tgcttaccga 
aatctggaag 
actaagaagg 
atgaaaggtt 
aattgtttgc 
cattagaaaa 
tggttacagt 
acatggaaga 
tgaagccgca 



702G 
706C 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9749 



<210> 93 
<211> 1415 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 93 

Met lie Leu Arg His Phe Gly Phe lie Leu lie lie Leu Thr Val Tyr 
1 5 10 - 15 



Leu Ser Ser Thr His Ala Thr Thr Arg Lys His His Arg Arg Ser lie 
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20 25 30 

Asp Asn Asp Ala Arg Asn Leu Gly Phe Gin Phe Val Met Glu Pro Arg 
35 40 45 

Arg Asn Val Thr Val Met Glu Ser Ser Ser His Leu Leu Glu Cvs Spt 
50 55 60 

Tyr Val Leu Ala His Glu Arg Leu Val His Asp Val Arg He Glu Trp 
65 70 75 80 

Lys Arg Asp Gly Val Leu Leu Ser Glu Arg Thr Ser Ser Arg He Lys 

85 90 95 

Val Met Ser Asn Gly Ser Leu Trp He Glu Ser Val Ser Ser Ala Glu 
100 105 * 110 

Glu Gly Thr Tyr Gin Cys Ala Val His Val Thr Thr Lys Ser Asp Gin 
115 120 125 

Thr Ser Asp Thr Trp Thr Phe Leu Ser Arg Lys Ala Thr Leu Arg Leu 
130 135 140 

Ala Asp Leu Ala Lys Phe Glu Leu Gin Ala He Asp Arg Thr Leu Ala 
145 150 155 160 

Lys Gly Gin Pro Thr Ala Phe His Cys Leu lie Asn Ser Lys Pro Thr 
165 170 175 

Pro Thr Ala Val Trp Leu His Asn Asp Glu Pro He Val Asn Gly Gly 
180 185 190 

Glu Tyr His He Leu Pro Val Ser Asn Thr Leu Glu He Ser Ser Thr 
195 200 205 

Gin Ser Arg His Glu Gly Thr Tyr Arg Cys Thr Val Glu Gly Ala Gly 
210 215 220 

Lys Arg Arg Ser Ser Gin Thr Ala Arg Leu Thr Val Thr Thr Glu Thr 
225 230 235 240 

Val Ser Asn Glu Leu Val Phe He Thr Thr Pro Arg Leu Gin Val Val 
245 250 255 

Glu Gin Gly Asp Glu Phe Leu Leu Glu Cys Leu Val Ala Ser Leu He 
260 265 270 

Arg Pro Gin Val Arg Trp Leu Lys Asp Ser Arg Gin He He Val Asp 
275 280 285 

Gly Val Arg He Arg Arg Val Gly Val Ser Ser lie Leu Val Ser Arg 
290 295 300 

Ala Ser lie Glu Asp Thr Gly Leu Tyr Thr Cys Arg Ala Ser Asn Asn 
305 310 315 320 

Asp Asp Ser lie Asp Arg Ala Val Ser Val Glu Val Arg Ala Pro Pro 
325 330 335 
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Arg He Thr Thr Arg Pro Thr Thr Lys Val Ala Val Glu Thr Ala Asp 
340 345 350 

Val Glu Leu Glu Cys Gly Thr Ala Ala Ala Arg Pro Glu Ala Arg Val 
355 360 365 

Asn Trp Tyr Lys Asn Gly Glu Ala He He Gly Ser Glu Tyr Phe Val 
370 375 380 

He Glu Pro Asn Arg Leu Arg lie Leu Gly Val Val Arg Ala Asp Gin 
385 390 395 400 

Ala He Tyr Gin Cys He Ala Glu Asn Asp Val Gly Ser Glu Gin Ala 
405 410 415 

Ser Ala Gin Leu Leu Val Asp Ala Pro Asp Ser Ser Ser Val Ala Ala 
420 425 430 

Ser Ser Gly Val Pro Met Thr Ser Ser Ala Pro Leu Gly Leu Arg Ser 
435 440 445 

Thr Ser Ser Gly Ser Arg Phe lie Asn Val Glu Trp Asp Pro Pro Val 
450 455 460 

Gin Arg Asn Gly Asn lie Met Arg Tyr His lie Phe Tyr Lys Asp Asn 
465 470 475 480 

Leu He Asp Arg Glu Arg Met lie Asn Ser Ser Ser Thr Ser Ala Thr 
485 490 495 

Leu Thr Ser Leu Gin Pro Ser Thr Met Tyr Leu lie Arg Val Thr Ala 
500 505 510 

Glu Asn Glu Ala Gly Met Gly Lys Phe Ser Asp Ser Leu Lys Val Thr 
515 520 525 

Thr Asn Lys Glu Gin Ala Val Pro Gly Lys Val Ala Ser Leu Thr Thr 
530 535 540 

Thr Ala Thr Gly Pro Glu Thr lie Asp lie Arg Trp Ser Pro Pro Ser 
545 550 555 560 

Gly Gly Gin Pro Ala Leu Arg Tyr Lys He Phe Tyr Ser His Asp Pro 
565 570 575 

Leu Glu Lys Asn Glu Lys Glu Thr Leu lie Thr Thr Ser Thr Thr His 
580 "* 585 590 

Tyr Thr Leu His Gly Met Asp Lys Tyr Thr Gly Tyr Gin lie Arg lie 
595 600 605 

Glu Ala Glu Gly Ser Asn Gly Ser Gly Leu Ser Ser Asp Thr Val Lys 
610 615 620 

Val Arg Thr Gin Ser Asp Glu Pro Ser Ala Pro Pro Val Asn He Gin 
625 630 635 640 
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Ala Glu Ala Asp Ser Ser Thr Ser Val Arg Val Ser Trp Asp Glu Pro 
645 650 * 655 

Glu Glu Glu Ser Val Asn Gly Glu lie Thr Gly Tyr Arg Leu Lys Tyr 
660 665 670 

Lys Thr Lys Ala Arg Gly Ala Lys Gly Asn Thr Leu Val He Asp Ala 
675 680 685 

Thr Ala Arg Glu Tyr Thr Met Gly Asn Leu Glu Pro Asn Thr Gin Tyr 
690 695 700 

Leu He Arg Met Ala Val Val Asn His Asn Gly Thr Gly Pro Phe Ser 
705 710 715 720 

Asp Trp Val Ser He Asp Thr Pro Gly Gin Asp Lys Glu Glu Arg Thr 
725 730 735 

Leu Gly Ala Pro Arg Glu He Arg Pro His Ala Gly He Asp Tyr He 
740 745 750 

Leu Val Ser Trp Leu Pro Pro Ala Asp Glu Gin Asn Leu Val Arg Gly 
755 7 60 7 65 

Tyr Gin He Gly Trp Gly Leu Ser Val Pro Asp Thr Glu Thr He Arg 
770 775 780 

Val Thr Ala Ser Thr Thr Gin Tyr Lys lie Ala Arg Leu His Ser Glu 
785 790 795 800 

Arg Asp Tyr Val He Ser Leu Arg Ala Phe Asn Asn Leu Gly Ser Gly 
805 810 " 815 

Phe Pro He Tyr Glu Thr Val Arg Thr Leu Ser Arg Glu Thr Pro Ser 
820 825 ~ 830 

His Phe Asn Glu Asp Ser Asp Ser Asp Asp Ser Asp Val Gly Ser Ser 
835 840 * 845 

Glu Ser Thr Pro Val Gly Val Arg Ala Glu Ala lie Ser Ala Thr Ser 
850 855 860 

lie Arg Val Met Trp Thr Glu Ser Asp Glu Thr Ala Phe Asn Thr Gin 
865 870 875 880 

Tyr Thr Val Arg Tyr Ser Thr Ala Val Asp Gly Asn Gin His Arg Tyr 
885 890 895 

Val Asn Ser Thr Glu Thr Trp Ala Thr Val Glu Gly Leu Arg Pro Ala 
900 905 910 

Thr Glu Tyr Glu Phe Ala Val Arg Ala Val Ala Ser Asn Gly Gin Leu 
915 920 925 

Ser Thr Trp Ser Met Ala Thr Arg Asn Arg Thr Leu Ala Ala Pro Pro 
930 935 940 

Ser Ser Ala Pro Arg Asp Leu Thr Val Leu Pro Ala Glu Ser Gly Asp 
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Leu Arg Gly Thr Pro Pro Asn Ser Ser Ala Ala Asn Ala Leu Arg Ser 
1265 " 1270 1275 1280 

Phe Thr Gin Leu Ala Gly Ala Thr Pro Pro Pro Pro His Ser Ala Ala 
1285 "* 1290 1295 

Ser Ser Ser Ser Arg Pro Thr lie lie Ala Ala Gly Gly Arg Gin Val 
1300 " 1305 1310 

Pro Val Gly Arg Ala Thr Ala Gin Pro Arg Val Asn Val Ala Asn lie 
1315 1320 1325 

Tyr Ser Pro Phe Ala Ser Cys Ser Ala Ser Ser Asp Ala Gly Glu Ser 
1330 1335 1340 

Asp Lys Lys Ser Gly Glu Cys Met Glu Met Arg Glu Thr Thr Pro lie 
1345 1350 1355 1360 

Lys Ser Asn Thr Ala Gly Ser Ser Asn Gly Glu Lys Met Asn Thr Asn 
1365 1370 1375 

Met Asn Pro Ser His Ser Ala Glu Asp Leu Asn Ala His Leu Glu Asn 
1380 1385 1390 

Leu Asp Thr Met Leu Asp Asp Leu Gin Gin Leu Gin His Asn Leu His 
1395 1400 1405 

Phe Glu Thr Ser Met Asp Lys 
1410 1415 



<210> 94 
<211> 4248 
<212> DNA 

<213> Caenorhabditis elegans 



<400> 94 

atgattttgc 

catgccacaa 

tttcaattcg 

ttagaatgta 

aaacgtgatg 

ggttcattat 

catgtgacta 

acgttacgat 

aaagggcagc 

tggcttcaca 

aacactttgg 

gaaggtgctg 

gtatcgaatg 

gaattcttac 

gattctcgtc 

cttgtctcac 

gatgattcaa 

cgacctacca 

gctgctagac 

gagtattttg 



gacatttcgg 
cacggaaaca 
taatggaacc 
gttatgttct 
gtgtgttgct 
ggatagaatc 
caaaaagcga 
tggcggattt 
caactgcgtt 
acgatgaacc 
agattagttc 
ggaagagaag 
agctagtttt 
ttgaatgcct 
aaattattgt 
gtgcttctat 
ttgatcgagc 
cgaaagttgc 
cagaggcacg 
taattgagcc 



tttcattctt 
tcatcgtcga 
acgacggaat 
ggctcatgag 
cagtgaacgg 
agtgagcagt 
tcaaacaagt 
ggcaaagttt 
tcattgtctt 
tattgtaaat 
cactcaatct 
aagtagccaa 
catcacaacg 
tgttgcctca 
cgatggagtt 
cgaagatact 
tgtatctgta 
tgttgaaact 
cgttaattgg 
gaacagactc 



ataattctga 
tcgattgata 
gtgactgtta 
agacttgtac 
acatcttcta 
gccgaagaag 
gatacgtgga 
gaactgcaag 
attaactcga 
ggtggcgaat 
agacacgaag 
actgcacgat 
ccacgtctac 
ctaatcagac 
cgtattcgac 
gggctataca 
gaagttcgtg 
gctgatgttg 
tacaaaaatg 
cgcatccttg 



cagtttatct 
atgatgctcg 
tggaatcatc 
acgatgtccg 
gaataaaagt 
gaacatatca 
catttttgag 
cgattgatcg 
aaccaacccc 
atcacattct 
ggacgtatag 
tgaccgttac 
aagttgttga 
ctcaagttag 
gtgttggcgt 
cttgtcgagc 
ctccgccgag 
aattagagtg 
gagaagcaat 
gagtcgtacg 



gtcatctact 
gaatcttggc 
gtctcattta 
aattgaatgg 
aatgtcaaat 
atgcgctgta 
tcgaaaagct 
aactctagca 
aacagcagtt 
accagtgtca 
gtgtacggta 
tactgaaaca 
acaaggcgac 
atggctcaaa 
atcgtcaatt 
ttcgaataat 
aatcactaca 
tggaacggcg 
aattgggagc 
agccgatcaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 
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945 950 955 960 

Pro His Ser Ser Ser Leu His Trp Gin Pro Pro Lys Tyr Ser Asn Gly 
965 970 975 

Glu He Glu Glu Tyr Leu Val Phe Tyr Thr Asp Arg Ala Ser Leu Ala 

o r> rv no C OOA 

Z/ \J \J S <J ~* -* ^ 

Asp Lys Asp Trp Thr He Asn Tyr Val Ala Gly Asp Lys Leu Ser His 
995 1000 1005 

Gin Val Ser Asn Leu Leu Pro Lys Ala Asn Tyr Phe Phe Lys He Gin 
1010 1015 1020 

Ala Arg Asn Glu Lys Gly His Gly Pro Phe Ser Ser Val Val Gly Tyr 
1025 1030 1035 1040 

Thr Pro Ser Gly Gly Ala He Leu Ser Gly Lys Asp Arg His Asn Ala 
1045 1050 1055 

Arg Gly His Gly Ser Ala Ala Ser Gly Asp Thr Val Ser Leu Val Asp 
1060 1065 1070 

Gin Leu Gin Ser Leu Leu His Ser Asn Pro Leu Tyr Leu He Leu Leu 
1075 1080 1085 

Ala Ala Phe Ala Leu He Leu He Leu Thr Leu He Leu He He Met 
1090 1095 1100 

Cys Cys Trp Lys Arg Ser Ser Gly Gly Gly Arg Lys Asn Gly Tyr Gin 
1105 ~ 1110 1115 1120 

Ser Gly Lys Lys Thr Ser Ala Gly Ala Gly Ser Gly Gly Gly He Gly 
1125 1130 1135 

Gly Leu Gly Gly Pro Pro Asn Asp Leu Trp He Asn Gly Thr Gly Ser 
1140 1145 1150 

His Met Arg Ala Gly Ala Ser Asp Tyr Met Val Asp Gly Leu Ala Thr 
1155 1160 1165 

Ala His Leu Thr Ala Ala Asp He Glu Ser Pro Thr Pro Arg Tyr His 
1170 1175 1180 

His Leu Gin Gly Gin Gly Thr Leu Thr Arg Ser Tyr His Gin Ser Ser 
1185 1190 1195 1200 

Gin Ser Leu Glu Gly Arg Gin Arg Thr Pro Gin Val Val Tyr Thr Gly 
1205 1210 1215 

Thr Gly Arg His Gin Pro He Gin Arg He Asp Phe Glu Ser Pro Tyr 
1220 1225 1230 

Gly Ser Ser Ser Ala He Gly Ser Ala Ser Thr Pro Pro Leu Pro Met 
1235 1240 1245 

Gin Ala Pro Pro Ser Gly Pro Pro Thr Val He Asp Gly Tyr Arg Thr 
1250 1255 1260 
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gcaatctacc aatgtattgc agaaaatgat 
cttgttgatg ctccagattc atcatcagtg 
tctgcaccgt taggcctccg ttcgacgtct 
gatccaccag ttcaaagaaa tgggaatatt 
ttaatcgaca gagaaaggat gatcaattca 
caaccatcaa caatgtattt aattcgagtt 
tttagtgatt cacttaaagt taccacaaac 
tcactaacaa caactgcaac tggccccgaa 
ggaggacaac ctgcccttcg atacaaaata 
gaaaaagaaa cattgattac gacctcgaca 
tacactggat atcagattcg aattgaagct 
gatactgtga aagtacgaac acaatccgat 
gcagaagctg atagctcaac aagtgttcga 
gttaacggag aaattactgg atatcggttg 
ggcaatactc ttgtgattga cgccactgct 
aatacacagt atctgataag aatggcagta 
gattgggttt ctattgatac tcctggacaa 
agagagattc gaccacatgc tggaattgat 
gatgaacaga atcttgttag aggatatcaa 
gaaacaattc gtgtaactgc ttccacaact 
cgagattatg tgatttctct tcgagcattt 
gagactgtcc gaacattgtc tcgtgaaact 
gacgattctg acgttggatc ctcagaatct 
tctgcaacat caatacgtgt tatgtggacg 
tatacagttc gatatagtac agctgttgat 
gagacatggg ctactgtgga aggactacga 
gctgttgcat caaatgggca gttgagtact 
gcggctccac caagttcagc tccacgtgat 
cctcactctt caagtcttca ttggcaacct 
tatttagtgt tttatacgga tcgagcttct 
gtagctggtg ataagttgtc acatcaagtc 
ttcaagattc aagcaagaaa tgagaaaggt 
actccatcag gcggtgctat tctatcagga 
tcagcggctt ctggcgatac agtttcctta 
aacccattgt atctcatact tttggctgcg 
cttatcataa tgtgttgttg gaaacgatct 
tctggaaaaa agacgagtgc cggagctgga 
ccacctaatg atttatggat taatggaact 
tatatggttg atggtctagc aaccgcccat 
cctagatacc atcatctaca agggcaagga 
caaagtcttg aaggacgaca acgaacacct 
cagccgattc aaagaatcga ctttgaaagt 
gcctcgactc cgcctctccc aatgcaagct 
ggataccgta ctctacgtgg aacacctcca 
ttcactcaat tggcaggagc aacaccacct 
cgaccgacta ttattgcagc cggtggacgg 
ccacgtgtca atgttgcaaa tatctacagt 
gcaggagaat cagataaaaa gagtggagaa 
aaatctaata cagctggttc atcgaatgga 
cactcggcag aagatctcaa cgcacatctt 
caacaattac agcacaattt gcattttgag 

<210> 95 
<211> 1447 
<212> PRT 

<213> Homo sapiens 
<400> 95 
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gttggaagtg aacaagctag tgcacaactt 1260 
gctgcttcct caggagttcc aatgacttcg 1320 
tctggttctc gctttataaa tgttgaatgg 1380 
atgagatatc atatattcta caaagataat 1440 
tcaagtacat ctgcaacgtt aacatctcta 1500 
actgctgaaa atgaagctgg aatgggaaag 1560 
aaagaacaag cagttcctgg aaaagtcgca 1620 
acgattgaca tccgttggag cccaccaagt 1680 
ttctattcgc atgatccatt ggagaaaaat 1740 
acacattata cacttcatgg aatggataaa 1800 
gaaggatcca atggatcggg gctttcaagt 1860 
gaaccatcag caccgcctgt aaacattcaa 1920 
gttagttggg atgagccaga agaggaatct 1980 
aagtacaaga caaaagctcg tggagccaaa 2040 
agagagtata ctatgggaaa tttggaaccg 2100 
gtgaatcaca atggaacggg acccttctcg 2160 
gataaagaag aacgaacact tggagcaccg 2220 
tatattcttg tatcatggct tccaccagca 2280 
attggatggg gattgagtgt tcctgatact 2340 
caatataaaa ttgccagatt gcattcggaa 2400 
aataatctag gatctggatt tccgatttat 2460 
ccatcacatt tcaatgaaga ttctgattct 2520 
actccagttg gagtgcgtgc tgaggcaatc 2580 
gaatctgatg aaactgcttt caatacacag 2640 
gggaatcaac atagatacgt gaatagcacc 2700 
ccagctactg agtatgagtt tgcggttaga 2760 
tggtcaatgg ctaccaggaa tcgaacatta 2820 
ctcactgtgc tgcctgcaga aagtggcgat 2880 
ccgaaatatt caaatggaga aattgaagaa 2940 
cttgcggaca aagattggac aattaactat 3000 
agcaatttac ttccaaaagc taattatttc 3060 
catggcccat tctcttctgt agttggatac 3120 
aaagatcgtc acaatgcacg tgggcatgga 3180 
gtcgatcaac ttcaatctct tcttcactct 3240 
tttgctctga ttctgatcct tacactgatt 3300 
agtggtggag ggcgaaagaa tggatatcag 3360 
agtggaggtg gaataggtgg tcttggaggt 3420 
ggaagtcata tgagagctgg tgcttctgat 3480 
cttactgctg ctgacattga atcacccaca 3540 
actctaacac gaagttatca tcaaagctca 3600 
caagtggttt atacaggaac tggaaggcat 3660 
ccttatggtt catcatctgc aattggatcc 3720 
ccaccttctg gtccacctac agtaatcgat 3780 
aattcatcag ctgcaaatgc tcttcggtca 3840 
ccaccacact ctgcggcatc atcatcttct 3900 
caagttccag ttggaagagc aactgctcaa 3960 
ccatttgcat catgctctgc atcttctgat 4020 
tgtatggaaa tgagagaaac cacacccatt 4080 
gagaagatga acacaaatat gaatccttca 4140 
gagaaccttg acacaatgct tgatgatcta 4200 
acgagtatgg ataagtaa 4248 
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Met Glu Asn Ser Leu Arg Cys Val Trp Val Pro Lys Leu Ala Phe Val 
15 10 15 

Leu Phe Gly Ala Ser Leu Leu Ser Ala His Leu Gin Val Thr Gly Phe 
20 25 30 

Gin lie Lys Ala Phe Thr Ala Leu Arg Phe Leu Ser Glu Pro Ser Asp 
35 40 45 

Ala Val Thr Met Arg Gly Gly Asn Val Leu Leu Asp Cys Ser Ala Glu 
50 55 60 

Ser Asp Arg Gly Val Pro Val lie Lys Trp Lys Lys Asp Gly lie His 
65 70 75 80 

Leu Ala Leu Gly Met Asp Glu Arg Lys Gin Gin Leu Ser Asn Gly Ser 
85 90 95 

Leu Leu He Gin Asn He Leu His Ser Arg His His Lys Pro Asp Glu 
100 105 110 

Gly Leu Tyr Gin Cys Glu Ala Ser Leu Gly Asp Ser Gly Ser He He 
115 120 125 

Ser Arg Thr Ala Lys Val Ala Val Ala Gly Pro Leu Arg Phe Leu Ser 
130 135 140 

Gin Thr Glu Ser Val Thr Ala Phe Met Gly Asp Thr Val Leu Leu Lys 
145 150 155 160 

Cys Glu Val He Gly Glu Pro Met Pro Thr lie His Trp Gin Lys Asn 
165 170 175 

Gin Gin Asp Leu Thr Pro He Pro Gly Asp Ser Arg Val Val Val Leu 
180 185 190 

Pro Ser Gly Ala Leu Gin lie Ser Arg Leu Gin Pro Gly Asp lie Gly 
195 200 205 

lie Tyr Arg Cys Ser Ala Arg Asn Pro Ala Ser Ser Arg Thr Gly Asn 
210 215 220 

Glu Ala Glu Val Arg lie Leu Ser Asp Pro Gly Leu His Arg Gin Leu 
225 230 235 240 

Tyr Phe Leu Gin Arg Pro Ser Asn Val Val Ala He Glu Gly Lys Asp 
245 250 255 

Ala Val Leu Glu Cys Cys Val Ser Gly Tyr Pro Pro Pro Ser Phe Thr 
260 265 270 

Trp Leu Arg Gly Glu Glu Val lie Gin Leu Arg Ser Lys Lys Tyr Ser 
275 280 285 

Leu Leu Gly Gly Ser Asn Leu Leu lie Ser Asn Val Thr Asp Asp Asp 
290 295 300 

Ser Gly Met Tyr Thr Cys Val Val Thr Tyr Lys Asn Glu Asn lie Ser 
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305 310 315 320 

Ala Ser Ala Glu Leu Thr Val Leu Val Pro Pro Trp Phe Leu Asn His 
325 330 335 

Pro Ser Asn Leu Tyr Ala Tyr Glu Ser Met Asp lie Glu Phe Glu Cys 
340 345 350 

Thr Val Ser Gly Lys Pro Val Pro Thr Val Asn Trp Met Lys Asn Gly 
355 360 365 

Asp Val Val lie Pro Ser Asp Tyr Phe Gin lie Val Gly Gly Ser Asn 
370 375 380 

Leu Arg lie Leu Gly Val Val Lys Ser Asp Glu Gly Phe Tyr Gin Cys 
385 390 395 400 

Val Ala Glu Asn Glu Ala Gly Asn Ala Gin Thr Ser Ala Gin Leu lie 
405 410 415 

Val Pro Lys Pro Ala lie Pro Ser Ser Ser Val Leu Pro Ser Ala Pro 
420 425 430 

Arg Asp Val Val Pro Val Leu Val Ser Ser Arg Phe Val Arg Leu Ser 
435 440 445 

Trp Arg Pro Pro Ala Glu Ala Lys Gly Asn lie Gin Thr Phe Thr Val 
450 455 460 

Phe Phe Ser Arg Glu Gly Asp Asn Arg Glu Arg Ala Leu Asn Thr Thr 
465 470 ~ 475 480 

Gin Pro Gly Ser Leu Gin Leu Thr Val Gly Asn Leu Lys Pro Glu Ala 
485 490 495 

Met Tyr Thr Phe Arg Val Val Ala Tyr Asn Glu Trp Gly Pro Gly Glu 
500 505 510 

Ser Ser Gin Pro lie Lys Val Ala Thr Gin Pro Glu Leu Gin Val Pro 
515 520 525 

Gly Pro Val Glu Asn Leu Gin Ala Val Ser Thr Ser Pro Thr Ser lie 
530 535 540 

Leu lie Thr Trp Glu Pro Pro Ala Tyr Ala Asn Gly Pro Val Gin Gly 
545 550 555 560 

Tyr Arg Leu Phe Cys Thr Glu Val Ser Thr Gly Lys Glu Gin Asn lie 
565 570 575 

Glu Val Asp Gly Leu Ser Tyr Lys Leu Glu Gly Leu Lys Lys Phe Thr 
580 585 590 

Glu Tyr Ser Leu Arg Phe Leu Ala Tyr Asn Arg Tyr Gly Pro Gly Val 
595 ~ 600 605 

Ser Thr Asp Asp lie Thr Val Val Thr Leu Ser Asp Val Pro Ser Ala 
610 615 620 
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Pro Pro Gin Asn Val Ser Leu Glu Val Val Asn Ser Arg Ser lie Lys 
625 630 635 640 

Val Ser Trp Leu Pro Pro Pro Ser Gly Thr Gin Asn Gly Phe lie Thr 
645 650 655 

Gly Tyr Lys lie Arg His Arg Lys Thr Thr Arg Arg Gly Glu Met Glu 
660 " J 665 670 

Thr Leu Glu Pro Asn Asn Leu Trp Tyr Leu Phe Thr Gly Leu Glu Lys 
675 680 685 

Gly Ser Gin Tyr Ser Phe Gin Val Ser Ala Met Thr Val Asn Gly Thr 
690 695 700 

Gly Pro Pro Ser Asn Trp Tyr Thr Ala Glu Thr Pro Glu Asn Asp Leu 
705 710 715 720 

Asp Glu Ser Gin Val Pro Asp Gin Pro Ser Ser Leu His Val Arg Pro 
725 730 735 

Gin Thr Asn Cys lie lie Met Ser Trp Thr Pro Pro Leu Asn Pro Asn 
740 745 750 

He Val Val Arg Gly Tyr He He Gly Tyr Gly Val Gly Ser Pro Tyr 
755 760 765 

Ala Glu Thr Val Arg Val Asp Ser Lys Gin Arg Tyr Tyr Ser He Glu 
770 775 780 

Arg Leu Glu Ser Ser Ser His Tyr Val lie Ser Leu Lys Ala Phe Asn 
785 790 795 800 

Asn Ala Gly Glu Gly Val Pro Leu Tyr Glu Ser Ala Thr Thr Arg Ser 
805 810 815 

He Thr Asp Pro Thr Asp Pro Val Asp Tyr Tyr Pro Leu Leu Asp Asp 
820 825 830 

Phe Pro Thr Ser Val Pro Asp Leu Ser Thr Pro Met Leu Pro Pro Val 
835 840 845 

Gly Val Gin Ala Val Ala Leu Thr His Asp Ala Val Arg Val Ser Trp 
850 855 860 

Ala Asp Asn Ser Val Pro Lys Asn Gin Lys Thr Ser Glu Val Arg Leu 
865 870 875 880 

Tyr Thr Val Arg Trp Arg Thr Ser Phe Ser Ala Ser Ala Lys Tyr Lys 
885 ^ 890 895 

Ser Glu Asp Thr Thr Ser Leu Ser Tyr Thr Ala Thr Gly Leu Lys Pro 
900 905 910 

Asn Thr Met Tyr Glu Phe Ser Val Met Val Thr Lys Asn Arg Arg Ser 
915 920 925 
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Ser Thr Trp Ser Met Thr Ala His Ala Thr Thr Tyr Glu Ala Ala Pro 
930 " 935 940 

Thr Ser Ala Pro Lys Asp Phe Thr Val lie Thr Arg Glu Gly Lys Pro 
945 950 955 960 

Arg Ala Val lie Val Ser Trp Gin Pro Pro Leu Glu Ala Asn Gly Lys 
965 970 975 

lie Thr Ala Tyr lie Leu Phe Tyr Thr Leu Asp Lys Asn lie Pro lie 
980 985 990 

Asp Asp Trp He Met Glu Thr He Ser Gly Asp Arg Leu Thr His Gin 
995 1000 1005 

He Met Asp Leu Asn Leu Asp Thr Met Tyr Tyr Phe Arg He Gin Ala 
1010 1015 1020 

Arg Asn Ser Lys Gly Val Gly Pro Leu Ser Asp Pro He Leu Phe Arg 
1025 1030 1035 1040 

Thr Leu Lys Val Glu His Pro Asp Lys Met Ala Asn Asp Gin Gly Arg 
1045 1050 1055 

His Gly Asp Gly Gly Tyr Trp Pro Val Asp Thr Asn Leu He Asp Arg 
1060 1065 1070 

Ser Thr Leu Asn Glu Pro Pro lie Gly Gin Met His Pro Pro His Gly 
1075 1080 1085 

Ser Val Thr Pro Gin Lys Asn Ser Asn Leu Leu Val lie lie Val Val 
1090 1095 1100 

Thr Val Gly Val lie Thr Val Leu Val Val Val lie Val Ala Val lie 
1105 1110 1H5 1120 

Cys Thr Arg Arg Ser Ser Ala Gin Gin Arg Lys Lys Arg Ala Thr His 
1125 1130 1135 

Ser Ala Gly Lys Arg Lys Gly Ser Gin Lys Asp Leu Arg Pro Pro Asp 
1140 " 1145 H50 

Leu Trp lie His His Glu Glu Met Glu Met Lys Asn lie Glu Lys Pro 
1155 1160 1165 

Ser Gly Thr Asp Pro Ala Gly Arg Asp Ser Pro lie Gin Ser Cys Gin 
1170 1175 1180 

Asp Leu Thr Pro Val Ser His Ser Gin Ser Glu Thr Gin Leu Gly Ser 
1185 1190 1195 1200 

Lys Ser Thr Ser His Ser Gly Gin Asp Thr Glu Glu Ala Gly Ser Ser 
1205 1210 1215 

Met Ser Thr Leu Glu Arg Ser Leu Ala Ala Arg Arg Ala Pro Arg Ala 
1220 1225 1230 

Lys Leu Met lie Pro Met Asp Ala Gin Ser Asn Asn Pro Ala Val Val 
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1235 1240 1245 

Ser Ala He Pro Val Pro Thr Leu Glu Ser Ala Gin Tyr Pro Gly He 
1250 1255 1260 

Leu Pro Ser Pro Thr Cys Gly Tyr Pro His Pro Gin Phe Thr Leu Arg 
1265 1270 1275 1280 

Pro Val Pro Phe Pro Thr Leu Ser Val Asp Arg Gly Phe Gly Ala Gly 
1285 1290 1295 

Arg Ser Gin Ser Val Ser Glu Gly Pro Thr Thr Gin Gin Pro Pro Met 
1300 1305 1310 

Leu Pro Pro Ser Gin Pro Glu His Ser Ser Ser Glu Glu Ala Pro Ser 
1315 1320 1325 

Arg Thr He Pro Thr Ala Cys Val Arg Pro Thr His Pro Leu Arg Ser 
1330 1335 1340 

Phe Ala Asn Pro Leu Leu Pro Pro Pro Met Ser Ala He Glu Pro Lys 
1345 1350 1355 1360 

Val Pro Tyr Thr Pro Leu Leu Ser Gin Pro Gly Pro Thr Leu Pro Lys 
1365 1370 1375 

Thr His Val Lys Thr Ala Ser Leu Gly Leu Ala Gly Lys Ala Arg Ser 
1380 1385 " 1390 

Pro Leu Leu Pro Val Ser Val Pro Thr Ala Pro Glu Val Ser Glu Glu 
1395 1400 1405 

Ser His Lys Pro Thr Glu Asp Ser Ala Asn Val Tyr Glu Gin Asp Asp 
1410 1415 1420 

Leu Ser Glu Gin Met Ala Ser Leu Glu Gly Leu Met Lys Gin Leu Asn 
1425 1430 1435 1440 

Ala He Thr Gly Ser Ala Phe 
1445 



<210> 96 
<211> 4344 
<212> DNA 

<213> Homo sapiens 
<400> 96 

atggagaata gtcttagatg tgtttgggta cccaagctgg cttttgtact cttcggagct 60 

tccttgctca gcgcgcatct tcaagtaacc ggttttcaaa ttaaagcttt cacagcactg 120 

cgcttcctct cagaaccttc tgatgccgtc acaatgcggg gaggaaatgt cctcctcgac 180 

tgctccgcgg agtccgaccg aggagttcca gtgatcaagt ggaagaaaga tggcattcat 240 

ctggccttgg gaatggatga aaggaagcag caactttcaa atgggtctct gctgatacaa 300 

aacatacttc attccagaca ccacaagcca gatgagggac tttaccaatg tgaggcatct 360 

ttaggagatt ctggctcaat tattagtcgg acagcaaaag ttgcagtagc aggaccactg 420 

aggttccttt cacagacaga atctgtcaca gccttcatgg gagacacagt gctactcaag 4 80 

tgtgaagtca ttggggagcc catgccaaca atccactggc agaagaacca acaagacctg 540 

actccaatcc caggtgactc ccgagtggtg gtcttgccct ctggagcatt gcagatcagc 600 
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cgactccaac cgggggacat tggaatttac 
agaacaggaa atgaagcaga agtcagaatt 
tattttctgc aaagaccatc caatgtagta 
tgttgtgttt ctggctatcc tccaccaagt 
caactcaggt ctaaaaagta ttctttattg 
acagatgatg acagtggaat gtatacctgt 
gcctctgcag agctcacagt cttggttccg 
tatgcctatg aaagcatgga tattgagttt 
actgtgaatt ggatgaagaa tggagatgtg 
ggaggaagca acttacggat acttggggtg 
gtggctgaaa atgaggctgg aaatgcccag 
gcaatcccaa gctccagtgt cctcccttcg 
tccagccgat ttgtccgtct cagctggcgc 
actttcacgg tctttttctc cagagaaggt 
cagcctgggt cccttcagct cactgtggga 
cgagttgtgg cttacaatga atggggaccg 
acacagcctg agttgcaagt tccagggcca 
cctacctcaa ttcttattac ctgggaaccc 
tacagattgt tctgcactga ggtgtccaca 
ctatcttata aactggaagg cctgaaaaaa 
tataatcgct atggtccggg cgtctctact 
gtgccaagtg ccccgcctca gaacgtctcc 
gttagctggc tgcctcctcc atcaggaaca 
cgacacagaa agacgacccg caggggtgag 
tacctattca caggactgga gaaaggaagt 
gtcaatggta ctggaccacc ttccaactgg 
gatgaatctc aagttcctga tcaaccaagc 
atcatcatga gttggactcc tcccttgaac 
ggttatggcg ttgggagccc ttacgctgag 
tattccattg agaggttaga gtcaagttcc 
aatgccggag aaggagttcc tctttatgaa 
actgacccag ttgattatta tcctttgctt 
tccaccccca tgctcccacc agtaggtgta 
agggtcagct gggcagacaa ctctgtccct 
tacaccgtcc ggtggagaac cagcttttct 
acatctctaa gttacacagc aacaggcctc 
atggtaacaa aaaacagaag gtccagtact 
gaagcagccc ccacctctgc tcccaaggac 
cgtgccgtca ttgtgagttg gcagcctccc 
atcttatttt ataccttgga caagaacatc 
agtggtgata ggcttactca tcaaatcatg 
cgaattcaag cacgaaattc aaaaggagtg 
actctgaaag tggaacaccc tgacaaaatg 
ggttattggc cagttgatac taatttgatt 
ggacaaatgc accccccgca tggcagtgtc 
atcattgtgg tcaccgttgg tgtcatcaca 
tgcacccgac gctcttcagc ccagcagaga 
aggaagggca gccagaagga cctccgaccc 
gagatgaaaa atattgaaaa gccatctggc 
caaagttgcc aagacctcac accagtcagc 
aaaagcacct ctcattcagg tcaagacact 
gagaggtcgc tggctgcacg ccgagccccc 
cagtccaaca atcctgctgt cgtgagcgcc 
tacccaggaa tcctcccgtc tcccacctgt 
cctgtgccat tcccaacact ctcagtggac 
gtgagtgaag gaccaactac ccaacaacca 
tctagcagcg aggaggcacc aagcagaacc 
ccactccgca gctttgctaa tcctttgcta 
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cgatgctcag ctcgaaatcc agccagctca 660 
ttatcagatc caggactgca tagacagctg 72 0 
gccattgaag gaaaagatgc tgtcctggaa 78 0 
tttacctggt tacgaggcga ggaagtcatc 840 
ggtggaagca acttgcttat ctccaatgtg 900 
gttgtcacat ataaaaatga gaatattagt 960 
ccatggtttt taaatcatcc ttccaacctg 1020 
gaatgtacag tctctggaaa gcctgtgccc 1080 
gtcattccta gtgattattt tcagatagtg 1140 
gtgaagtcag atgaaggctt ttatcaatgt 1200 
accagtgcac agctcattgt ccctaagcct 1260 
gctcccagag atgtggtccc tgtcttggtt 1320 
ccacctgcag aagcgaaagg gaacattcaa 1380 
gacaacaggg aacgagcatt gaatacaaca 1440 
aacctgaagc cagaagccat gtacaccttt 1500 
ggagagagtt ctcaacccat caaggtggcc 1560 
gtagaaaacc tgcaagctgt atctacctca 1620 
cctgcctatg caaacggtcc agtccaaggt 1680 
ggaaaagaac agaatataga ggttgatgga 1740 
ttcaccgaat atagtcttcg attcttagct 1800 
gatgatataa cagtggttac actttctgac 18 60 
ctggaagtgg tcaattcaag aagtatcaaa 1920 
caaaatggat ttattaccgg ctataaaatt 1980 
atggaaacac tggagccaaa caacctctgg 2040 
cagtacagtt tccaggtgtc agccatgaca 2100 
tatactgcag agactccaga gaatgatcta 2160 
tctcttcatg tgaggcccca gactaactgc 2220 
ccaaacatcg tggtgcgagg ttatattatc 2280 
acagtgcgtg tggacagcaa gcagcgatat 2340 
cattatgtaa tctccctaaa agcttttaac 2400 
agtgccacca ccaggtctat aaccgatccc 24 60 
gatgatttcc ccacctcggt cccagatctc 2520 
caggctgtgg ctcttaccca tgatgctgtg 2580 
aagaaccaaa agacgtctga ggtgcgactt 2640 
gcaagtgcaa aatacaagtc agaagacaca 2700 
aaaccaaaca caatgtatga attctcggtc 2760 
tggagcatga ctgcacatgc caccacgtat 2820 
tttacagtca ttactaggga agggaagcct 2880 
ttggaagcca atgggaaaat tactgcttac 2940 
ccaattgatg actggattat ggaaacaatc 3000 
gatctcaacc ttgatactat gtattacttt 3060 
gggccactct ctgatcccat cctcttcagg 3120 
gctaatgacc aaggtcgtca tggagatgga 3180 
gatagaagca ccctaaatga gccgc.caatt 3240 
actcctcaga agaacagcaa cctgcttgtg 3300 
gtgctggtag tggtcatcgt ggctgtgatt 3360 
aagaaacggg ccacccacag tgctggcaaa 3420 
cctgatcttt ggatccatca tgaagaaatg 3480 
actgaccctg caggaaggga ctctcccatc 3540 
cacagccagt cagaaaccca actgggaagc 3600 
gaggaagcag ggagctctat gtccactctg 3660 
cgggccaagc tcatgattcc catggatgcc 3720 
atcccggtgc caacgctaga aagtgcccag 3780 
ggatatcccc acccgcagtt cactctccgg 3840 
cgaggtttcg gagcaggaag aagtcagtca 3900 
cctatgctgc ccccatctca gcctgagcat 3960 
atccccacag cttgtgttcg accaactcac 4020 
cctccaccaa tgagtgcaat agaaccgaaa 4080 
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gtcccttaca caccactttt gtctcagcca 
acagcctccc ttgggttggc tggaaaagca 
acagcccctg aagtgtctga ggagagccac 
gaacaggatg atctgagtga acaaatggca 
gccatcacag gctcagcctt ttaa 
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gggcccactc ttcctaagac ccatgtgaaa 4140 
agatcccctt tgcttcctgt gtctgtgcca 4200 
aaaccaacag aggattcagc caatgtgtat 4260 
agtttggaag gactcatgaa gcagcttaat 4320 

4344 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-18 (totally) and 19-28,30-59,61-64,66-67, 
69 (all partially) 



1.1. Claims: 1-6 (totally) and 19-28,30-59,61-64,66-67, 
69 (all partially) 

A human Unc-5Cb protein (SEQ ID 2), nucleic acids 
encoding it (SEQ ID 1), vectors and cells expressing 
it and an antibody binding thereto. Methods for 
identifying compounds which are capable of modulating 
the binding of Unc-5Cb to an interacting protein. 



1.2. Claims: 7-12,71,85 (totally) and 19-28,30-59,61-64, 
66-67,69 (all partially) 

As for subject 1.1, but concerning a human Unc-5Cc 
protein (SEQ ID 4). 



1.3. Claims: 13-18 (totally) and 19-28,30-59,61-64,66-67, 
69 (all partially) 

As for subject 1.1, but concerning a human Unc-5C8 
protein (SEQ ID 6). 



2. Claims: 19,29-58 (all partially) 

A method, as characterised in claim 19, for identifying 
compounds capable of modulating the binding of an unc-5 
protein to an interacting protein. 



3. Claims: 20,29-58 (all partially) 

A method, as characterised in claim 20, for identifying 
compounds capable of modulating the binding of an unc-5 
protein to an interacting protein. 



4. Claims: 21,29-58 (all partially) 

A method, as characterised in claim 21, for identifying 
compounds capable of modulating the binding of an unc-5 
protein to an interacting protein. 



5. Claims: 22,29-58 (all partially) 
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A method, as characterised in claim 22, for identifying 
compounds capable of modulating the binding of an unc-5 
protein to an interacting protein. 



6. Claims: 23-58 (all partially) 

A method, as characterised in claim 23, for identifying 
compounds capable of modulating the binding of an unc-5 
protein to an interacting protein. 



7. Claims: 59,61-64,66-67,69 (all partially) and claims 60,65, 
68 (totally) 



A method for identifying compounds reducing or inhibiting 
the lethal phenotype associated with the expression of an 
UNC-5 death domain in yeast. 



8. Claims: 70,80-84 (totally) and 19-28,30-59,61-64,66-67, 
69 (all partially) 



A nucleic acid encoding the human unc-5Hl homolog (SEQ ID 
7), probes and anti sense nucleic acids hybridizing 
therewith, vectors and cells comprising it. Methods for 
identifying compounds which are capable of modulating the 
binding of Unc-5H1 to an interacting protein. 



9. Claims: 72-73 (totally) and 19-31,53 (all partially) 

A nucleic acid obtainable by digestion of pYMP17 with EcoRI 
and Xhol (SEQ ID 56). Methods for identifying compounds 
which are capable of modulating the binding of Unc-5 protein 
to the protein encoded by SEQ ID 56. 



10. Claims: 74-75 (totally) and 19-31,54 (all partially) 

A nucleic acid obtainable by digestion of pYMP6 with EcoRI 
and Xhol (SEQ ID 54). Methods for identifying compounds 
which are capable of modulating the binding of Unc-5 protein 
to the protein encoded by SEQ ID 54. 



11. Claims: 76-77 (totally) and 19-31,57 (all partially) 

A nucleic acid obtainable by digestion of pYMPll with EcoRI 
and Xhol (SEQ ID 61). Methods for identifying compounds 
which are capable of modulating the binding of Unc-5 protein 
to the protein encoded by SEQ ID 61. 
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12. Claims: 78-79 (totally) and 19-31,58 (all partially) 

A nucleic acid obtainable by digestion of pYMP12 with EcoRI 
and Xhol (SEQ ID 63). Methods for identifying compounds 
which are capable of modulating the binding of Unc-5 protein 
to the protein encoded by SEQ ID 63. 
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